Databricks-Certified-Data-Engineer-Associate 無料問題集「Databricks Certified Data Engineer Associate」
A data engineer has a single-task Job that runs each morning before they begin working. After identifying an upstream data issue, they need to set up another task to run a new notebook prior to the original task.
Which of the following approaches can the data engineer use to set up the new task?
Which of the following approaches can the data engineer use to set up the new task?
正解:A
解答を投票する
解説: (JPNTest メンバーにのみ表示されます)
A data engineer has a Python notebook in Databricks, but they need to use SQL to accomplish a specific task within a cell. They still want all of the other cells to use Python without making any changes to those cells.
Which of the following describes how the data engineer can use SQL within a cell of their Python notebook?
Which of the following describes how the data engineer can use SQL within a cell of their Python notebook?
正解:E
解答を投票する
解説: (JPNTest メンバーにのみ表示されます)
A data engineer has left the organization. The data team needs to transfer ownership of the data engineer's Delta tables to a new data engineer. The new data engineer is the lead engineer on the data team.
Assuming the original data engineer no longer has access, which of the following individuals must be the one to transfer ownership of the Delta tables in Data Explorer?
Assuming the original data engineer no longer has access, which of the following individuals must be the one to transfer ownership of the Delta tables in Data Explorer?
正解:C
解答を投票する
解説: (JPNTest メンバーにのみ表示されます)
A data engineer has realized that the data files associated with a Delta table are incredibly small. They want to compact the small files to form larger files to improve performance.
Which of the following keywords can be used to compact the small files?
Which of the following keywords can be used to compact the small files?
正解:D
解答を投票する
解説: (JPNTest メンバーにのみ表示されます)
An engineering manager wants to monitor the performance of a recent project using a Databricks SQL query. For the first week following the project's release, the manager wants the query results to be updated every minute. However, the manager is concerned that the compute resources used for the query will be left running and cost the organization a lot of money beyond the first week of the project's release.
Which of the following approaches can the engineering team use to ensure the query does not cost the organization any money beyond the first week of the project's release?
Which of the following approaches can the engineering team use to ensure the query does not cost the organization any money beyond the first week of the project's release?
正解:D
解答を投票する
解説: (JPNTest メンバーにのみ表示されます)
A data engineer is designing a data pipeline. The source system generates files in a shared directory that is also used by other processes. As a result, the files should be kept as is and will accumulate in the directory. The data engineer needs to identify which files are new since the previous run in the pipeline, and set up the pipeline to only ingest those new files with each run.
Which of the following tools can the data engineer use to solve this problem?
Which of the following tools can the data engineer use to solve this problem?
正解:C
解答を投票する
解説: (JPNTest メンバーにのみ表示されます)