次の認定試験に速く合格する！

簡単に認定試験を準備し、学び、そして合格するためにすべてが必要だ。

会員センターカート (0)

Databricks-Certified-Data-Engineer-Professional 無料問題集「Databricks Certified Data Engineer Professional」

ページ: 1 / 13
トータル 127 問

サインアップ、ログインされた後に、試験全体を無料で表示できるようになります。

質問 1

A small company based in the United States has recently contracted a consulting firm in India to implement several new data engineering pipelines to power artificial intelligence applications. All the company's data is stored in regional cloud storage in the United States.
The workspace administrator at the company is uncertain about where the Databricks workspace used by the contractors should be deployed.
Assuming that all data governance considerations are accounted for, which statement accurately informs this decision?

（A）Databricks runs HDFS on cloud volume storage; as such, cloud virtual machines must be deployed in the region where the data is stored.

（B）Databricks notebooks send all executable code from the user's browser to virtual machines over the open internet; whenever possible, choosing a workspace region near the end users is the most secure.

（C）Cross-region reads and writes can incur significant costs and latency; whenever possible, compute should be deployed in the same region the data is stored.

（D）Databricks workspaces do not rely on any regional infrastructure; as such, the decision should be Get Latest & Actual Certified-Data-Engineer-Professional Exam's Question and Answers from made based upon what is most convenient for the workspace administrator.

（E）Databricks leverages user workstations as the driver during interactive development; as such, users should always use a workspace deployed in a region they are physically near.

正解：C 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 2

The data engineering team is migrating an enterprise system with thousands of tables and views into the Lakehouse. They plan to implement the target architecture using a series of bronze, silver, and gold tables. Bronze tables will almost exclusively be used by production data engineering workloads, while silver tables will be used to support both data engineering and machine learning workloads. Gold tables will largely serve business intelligence and reporting purposes. While personal identifying information (PII) exists in all tiers of data, pseudonymization and anonymization rules are in place for all data at the silver and gold levels.
The organization is interested in reducing security concerns while maximizing the ability to collaborate across diverse teams.
Which statement exemplifies best practices for implementing this system?

（A）Working in the default Databricks database provides the greatest security when working with managed tables, as these will be created in the DBFS root.

（B）Isolating tables in separate databases based on data quality tiers allows for easy permissions management through database ACLs and allows physical separation of default storage locations for managed tables.

（C）Because databases on Databricks are merely a logical construct, choices around database organization do not impact security or discoverability in the Lakehouse.

（D）Storinq all production tables in a single database provides a unified view of all data assets available throughout the Lakehouse, simplifying discoverability by granting all users view privileges on this database.

（E）Because all tables must live in the same storage containers used for the database they're created in, organizations should be prepared to create between dozens and thousands of databases depending on their data isolation requirements.

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 3

A junior data engineer is migrating a workload from a relational database system to the Databricks Lakehouse. The source system uses a star schema, leveraging foreign key constrains and multi-table inserts to validate records on write.
Which consideration will impact the decisions made by the engineer while migrating this workload?

（A）Databricks supports Spark SQL and JDBC; all logic can be directly migrated from the source system without refactoring.

（B）Committing to multiple tables simultaneously requires taking out multiple table locks and can lead to a state of deadlock.

（C）All Delta Lake transactions are ACID compliance against a single table, and Databricks does not enforce foreign key constraints.

（D）Databricks only allows foreign key constraints on hashed identifiers, which avoid collisions in highly-parallel writes.

（E）Foreign keys must reference a primary key field; multi-table inserts must leverage Delta Lake's upsert functionality.

正解：C 解答を投票する

質問 4

The Databricks workspace administrator has configured interactive clusters for each of the data engineering groups. To control costs, clusters are set to terminate after 30 minutes of inactivity.
Each user should be able to execute workloads against their assigned clusters at any time of the day.
Assuming users have been added to a workspace but not granted any permissions, which of the following describes the minimal permissions a user would need to start and attach to an already configured cluster.

（A）Cluster creation allowed. "Can Attach To" privileges on the required cluster

（B）"Can Restart" privileges on the required cluster

（C）Cluster creation allowed. "Can Restart" privileges on the required cluster

（D）"Can Manage" privileges on the required cluster

（E）Workspace Admin privileges, cluster creation allowed. "Can Attach To" privileges on the required cluster

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 5

A Delta Lake table was created with the below query:

Consider the following query:
DROP TABLE prod.sales_by_store
If this statement is executed by a workspace admin, which result will occur?

（A）The table will be removed from the catalog and the data will be deleted.

（B）An error will occur because Delta Lake prevents the deletion of production data.

（C）Nothing will occur until a COMMIT command is executed.

（D）Data will be marked as deleted but still recoverable with Time Travel.

（E）The table will be removed from the catalog but the data will remain in storage.

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 6

An upstream system has been configured to pass the date for a given batch of data to the Databricks Jobs API as a parameter. The notebook to be scheduled will use this parameter to load data with the following code:
df = spark.read.format("parquet").load(f"/mnt/source/(date)")
Which code block should be used to create the date Python variable used in the above code block?

（A）import sys
date = sys.argv[1]

（B）dbutils.widgets.text("date", "null")
date = dbutils.widgets.get("date")

（C）date = spark.conf.get("date")

（D）date = dbutils.notebooks.getParam("date")

（E）input_dict = input()
date= input_dict["date"]

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 7

A junior data engineer is working to implement logic for a Lakehouse table named silver_device_recordings. The source data contains 100 unique fields in a highly nested JSON structure.
The silver_device_recordings table will be used downstream to power several production monitoring dashboards and a production model. At present, 45 of the 100 fields are being used in at least one of these applications.
The data engineer is trying to determine the best approach for dealing with schema declaration given the highly-nested structure of the data and the numerous fields.
Which of the following accurately presents information about Delta Lake and Databricks that may impact their decision-making process?

（A）Because Delta Lake uses Parquet for data storage, data types can be easily evolved by just modifying file footer information in place.

（B）Because Databricks will infer schema using types that allow all observed data to be processed, setting types manually provides greater assurance of data quality enforcement.

（C）The Tungsten encoding used by Databricks is optimized for storing string data; newly-added native support for querying JSON strings means that string types are always most efficient.

（D）Human labor in writing code is the largest cost associated with data engineering workloads; as such, automating table declaration logic should be a priority in all migration workloads.

（E）Schema inference and evolution on .Databricks ensure that inferred types will always accurately match the data types used by downstream systems.

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 8

A data architect has designed a system in which two Structured Streaming jobs will concurrently write to a single bronze Delta table. Each job is subscribing to a different topic from an Apache Kafka source, but they will write data with the same schema. To keep the directory structure simple, a data engineer has decided to nest a checkpoint directory to be shared by both streams.
The proposed directory structure is displayed below:

Which statement describes whether this checkpoint directory structure is valid for the given scenario and why?

（A）Yes; both of the streams can share a single checkpoint directory.
Get Latest & Actual Certified-Data-Engineer-Professional Exam's Question and Answers from

（B）No; each of the streams needs to have its own checkpoint directory.

（C）No; Delta Lake manages streaming checkpoints in the transaction log.

（D）No; only one stream can write to a Delta Lake table.

（E）Yes; Delta Lake supports infinite concurrent writers.

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 9

The DevOps team has configured a production workload as a collection of notebooks scheduled to run daily using the Jobs UI. A new data engineering hire is onboarding to the team and has requested access to one of these notebooks to review the production logic.
What are the maximum notebook permissions that can be granted to the user without allowing accidental changes to production code or data?

（A）No permissions

（B）Can Read

（C）Can Manage

（D）Can Edit

（E）Can Run

正解：B 解答を投票する

質問 10

What statement is true regarding the retention of job run history?

（A）It is retained for 30 days, during which time you can deliver job run logs to DBFS or S3

（B）It is retained for 90 days or until the run-id is re-used through custom run configuration

（C）It is retained until you export or delete job run logs

（D）t is retained for 60 days, during which you can export notebook run results to HTML

（E）It is retained for 60 days, after which logs are archived

正解：D 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

ページ: 1 / 13
トータル 127 問

Databricks-Certified-Data-Engineer-Professional の機能をすべて解除する

キャプチャ不要
365日無料更新サービス
希望する合格率を設定できる
時間の割り当てられる（時間：分）
Databricks-Certified-Data-Engineer-Professional に2つの練習用モード
サポートサービス対応

完全版を入手する

弊社を連絡する

我々は１２時間以内ですべてのお問い合わせを答えます。

オンラインサポート時間：( UTC+9 ) 9:00-24:00
月曜日から土曜日まで

サポート：現在連絡

トップ試験

Financial-Services-Cloud 試験問題集
Professional-Cloud-Network-Engineer 試験問題集
DP-100 試験問題集
ACRP-CP 試験問題集
PHR 試験問題集
DA0-002 試験問題集

Databricks-Certified-Data-Engineer-Professional 無料問題集「Databricks Certified Data Engineer Professional」

弊社を連絡する

関連リンク

トップ試験