次の認定試験に速く合格する！

簡単に認定試験を準備し、学び、そして合格するためにすべてが必要だ。

会員センターカート (0)

DEA-C01 無料問題集「Snowflake SnowPro Advanced: Data Engineer Certification」

ページ: 1 / 14
トータル 132 問

サインアップ、ログインされた後に、試験全体を無料で表示できるようになります。

質問 1

A financial company wants to implement a data mesh. The data mesh must support centralized data governance, data analysis, and data access control. The company has decided to use AWS Glue for data catalogs and extract, transform, and load (ETL) operations.
Which combination of AWS services will implement a data mesh? (Choose two.)

（A）Use Amazon S3 for data storage. Use Amazon Athena for data analysis.

（B）Use Amazon Aurora for data storage. Use an Amazon Redshift provisioned cluster for data analysis.

（C）Use AWS Lake Formation for centralized data governance and access control.

（D）Use AWS Glue DataBrew for centralized data governance and access control.

（E）Use Amazon RDS for data storage. Use Amazon EMR for data analysis.

正解：A、C 解答を投票する

質問 2

A company needs to set up a data catalog and metadata management for data sources that run in the AWS Cloud. The company will use the data catalog to maintain the metadata of all the objects that are in a set of data stores. The data stores include structured sources such as Amazon RDS and Amazon Redshift. The data stores also include semistructured sources such as JSON files and .xml files that are stored in Amazon S3.
The company needs a solution that will update the data catalog on a regular basis. The solution also must detect changes to the source metadata.
Which solution will meet these requirements with the LEAST operational overhead?

（A）Use the AWS Glue Data Catalog as the central metadata repository. Extract the schema for Amazon RDS and Amazon Redshift sources, and build the Data Catalog. Use AWS Glue crawlers for data that is in Amazon S3 to infer the schema and to automatically update the Data Catalog.

（B）Use the AWS Glue Data Catalog as the central metadata repository. Use AWS Glue crawlers to connect to multiple data stores and to update the Data Catalog with metadata changes. Schedule the crawlers to run periodically to update the metadata catalog.

（C）Use Amazon Aurora as the data catalog. Create AWS Lambda functions that will connect to the data catalog. Configure the Lambda functions to gather the metadata information from multiple sources and to update the Aurora data catalog. Schedule the Lambda functions to run periodically.

（D）Use Amazon DynamoDB as the data catalog. Create AWS Lambda functions that will connect to the data catalog. Configure the Lambda functions to gather the metadata information from multiple sources and to update the DynamoDB data catalog. Schedule the Lambda functions to run periodically.

正解：B 解答を投票する

質問 3

What is the primary purpose of data lineage in data engineering?

（A）To optimize query performance.

（B）To trace the source and flow of data.

（C）To transform data formats.

（D）To create visualizations.

正解：B 解答を投票する

質問 4

A data engineer must build an extract, transform, and load (ETL) pipeline to process and load data from 10 source systems into 10 tables that are in an Amazon Redshift database. All the source systems generate .csv, JSON, or Apache Parquet files every 15 minutes. The source systems all deliver files into one Amazon S3 bucket. The file sizes range from 10 MB to 20 GB.
The ETL pipeline must function correctly despite changes to the data schema.
Which data pipeline solutions will meet these requirements? (Choose two.)

（A）Configure an AWS Lambda function to invoke an AWS Glue crawler when a file is loaded into the S3 bucket. Configure an AWS Glue job to process and load the data into the Amazon Redshift tables. Create a second Lambda function to run the AWS Glue job. Create an Amazon EventBridge rule to invoke the second Lambda function when the AWS Glue crawler finishes running successfully.

（B）Use an Amazon EventBridge rule to invoke an AWS Glue workflow job every 15 minutes.
Configure the AWS Glue workflow to have an on-demand trigger that runs an AWS Glue crawler and then runs an AWS Glue job when the crawler finishes running successfully. Configure the AWS Glue job to process and load the data into the Amazon Redshift tables.

（C）Configure an AWS Lambda function to invoke an AWS Glue workflow when a file is loaded into the S3 bucket. Configure the AWS Glue workflow to have an on-demand trigger that runs an AWS Glue crawler and then runs an AWS Glue job when the crawler finishes running successfully. Configure the AWS Glue job to process and load the data into the Amazon Redshift tables.

（D）Configure an AWS Lambda function to invoke an AWS Glue job when a file is loaded into the S3 bucket. Configure the AWS Glue job to read the files from the S3 bucket into an Apache Spark DataFrame. Configure the AWS Glue job to also put smaller partitions of the DataFrame into an Amazon Kinesis Data Firehose delivery stream. Configure the delivery stream to load data into the Amazon Redshift tables.

（E）Use an Amazon EventBridge rule to run an AWS Glue job every 15 minutes. Configure the AWS Glue job to process and load the data into the Amazon Redshift tables.

正解：B、C 解答を投票する

質問 5

A company uses an Amazon QuickSight dashboard to monitor usage of one of the company's applications. The company uses AWS Glue jobs to process data for the dashboard. The company stores the data in a single Amazon S3 bucket. The company adds new data every day.
A data engineer discovers that dashboard queries are becoming slower over time. The data engineer determines that the root cause of the slowing queries is long-running AWS Glue jobs.
Which actions should the data engineer take to improve the performance of the AWS Glue jobs?
(Choose two.)

（A）Partition the data that is in the S3 bucket. Organize the data by year, month, and day.

（B）Modify the IAM role that grants access to AWS glue to grant access to all S3 features.

（C）Adjust AWS Glue job scheduling frequency so the jobs run half as many times each day.

（D）Convert the AWS Glue schema to the DynamicFrame schema class.

（E）Increase the AWS Glue instance size by scaling up the worker type.

正解：A、E 解答を投票する

質問 6

A company uses Amazon EMR as an extract, transform, and load (ETL) pipeline to transform data that comes from multiple sources. A data engineer must orchestrate the pipeline to maximize performance.
Which AWS service will meet this requirement MOST cost effectively?

（A）AWS Step Functions

（B）Amazon Managed Workflows for Apache Airflow (Amazon MWAA)

（C）AWS Glue Workflows

（D）Amazon EventBridge

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 7

A data engineer creates an AWS Lambda function that an Amazon EventBridge event will invoke.
When the data engineer tries to invoke the Lambda function by using an EventBridge event, an AccessDeniedException message appears.
How should the data engineer resolve the exception?

（A）Ensure that the subnet where the Lambda function is deployed is configured to be a private subnet.

（B）Ensure that both the IAM role that EventBridge uses and the Lambda function's resource-based policy have the necessary permissions.

（C）Ensure that EventBridge schemas are valid and that the event mapping configuration is correct.

（D）Ensure that the trust policy of the Lambda function execution role allows EventBridge to assume the execution role.

正解：B 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 8

A marketing company collects clickstream data. The company sends the clickstream data to Amazon Kinesis Data Firehose and stores the clickstream data in Amazon S3. The company wants to build a series of dashboards that hundreds of users from multiple departments will use.
The company will use Amazon QuickSight to develop the dashboards. The company wants a solution that can scale and provide daily updates about clickstream activity.
Which combination of steps will meet these requirements MOST cost-effectively? (Choose two.)

（A）Access the query data through QuickSight SPICE (Super-fast, Parallel, In-memory Calculation Engine). Configure a daily refresh for the dataset.

（B）Access the query data through a QuickSight direct SQL query.

（C）Use Amazon Athena to query the clickstream data

（D）Use Amazon Redshift to store and query the clickstream data.

（E）Use Amazon S3 analytics to query the clickstream data.

正解：A、C 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 9

A company plans to use Amazon Kinesis Data Firehose to store data in Amazon S3. The source data consists of 2 MB .csv files. The company must convert the .csv files to JSON format. The company must store the files in Apache Parquet format.
Which solution will meet these requirements with the LEAST development effort?

（A）Use Kinesis Data Firehose to convert the .csv files to JSON and to store the files in Parquet format.

（B）Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON. Use Kinesis Data Firehose to store the files in Parquet format.

（C）Use Kinesis Data Firehose to invoke an AWS Lambda function that transforms the .csv files to JSON and stores the files in Parquet format.

（D）Use Kinesis Data Firehose to convert the .csv files to JSON. Use an AWS Lambda function to store the files in Parquet format.

正解：A 解答を投票する

解説: (JPNTest メンバーにのみ表示されます)

質問 10

A company stores datasets in JSON format and .csv format in an Amazon S3 bucket. The company has Amazon RDS for Microsoft SQL Server databases, Amazon DynamoDB tables that are in provisioned capacity mode, and an Amazon Redshift cluster. A data engineering team must develop a solution that will give data scientists the ability to query all data sources by using syntax similar to SQL.
Which solution will meet these requirements with the LEAST operational overhead?

（A）Use AWS Glue to crawl the data sources. Store metadata in the AWS Glue Data Catalog. Use Redshift Spectrum to query the data. Use SQL for structured data sources. Use PartiQL for data that is stored in JSON format.

（B）Use AWS Glue to crawl the data sources. Store metadata in the AWS Glue Data Catalog. Use Amazon Athena to query the data. Use SQL for structured data sources. Use PartiQL for data that is stored in JSON format.

（C）Use AWS Lake Formation to create a data lake. Use Lake Formation jobs to transform the data from all data sources to Apache Parquet format. Store the transformed data in an S3 bucket. Use Amazon Athena or Redshift Spectrum to query the data.

（D）Use AWS Glue to crawl the data sources. Store metadata in the AWS Glue Data Catalog. Use AWS Glue jobs to transform data that is in JSON format to Apache Parquet or .csv format. Store the transformed data in an S3 bucket. Use Amazon Athena to query the original and transformed data from the S3 bucket.

正解：B 解答を投票する

ページ: 1 / 14
トータル 132 問

DEA-C01 の機能をすべて解除する

キャプチャ不要
365日無料更新サービス
希望する合格率を設定できる
時間の割り当てられる（時間：分）
DEA-C01 に2つの練習用モード
サポートサービス対応

完全版を入手する

弊社を連絡する

我々は１２時間以内ですべてのお問い合わせを答えます。

オンラインサポート時間：( UTC+9 ) 9:00-24:00
月曜日から土曜日まで

サポート：現在連絡

トップ試験

C_S4EWM_2023 試験問題集
300-620 試験問題集
1Z0-1123-25 試験問題集
1Z0-1084-25 試験問題集
HCVA0-003 試験問題集
S2000-024 試験問題集

DEA-C01 無料問題集「Snowflake SnowPro Advanced: Data Engineer Certification」

弊社を連絡する

関連リンク

トップ試験