September 27, 2024

[2023] Databricks-Certified-Professional-Data-Engineer Exam Dumps, Test Engine Practice Test Questions [Q131-Q153]

Rate this post

[2023] Databricks-Certified-Professional-Data-Engineer Exam Dumps, Test Engine Practice Test Questions

Pass Databricks-Certified-Professional-Data-Engineer exam [Apr 08, 2023] Updated 220 Questions

Q131. A new data engineer [email protected] has been assigned to an ELT project. The new data
engineer will need full privileges on the table sales to fully manage the project.
Which of the following commands can be used to grant full permissions on the table to the new data engineer?

 
 
 
 
 

Q132. Which of the following scenarios is the best fit for the AUTO LOADER solution?

 
 
 
 
 

Q133. Which of the following statements can be used to test the functionality of code to test number of rows in the table equal to 10 in python?
row_count = spark.sql(“select count(*) from table”).collect()[0][0]

 
 
 
 
 

Q134. What is the purpose of gold layer in Multi hop architecture?

 
 
 
 
 

Q135. Which of the following benefits does Delta Live Tables provide for ELT pipelines over standard data pipelines
that utilize Spark and Delta Lake on Databricks?

 
 
 
 
 

Q136. You have accidentally deleted records from a table called transactions, what is the easiest way to restore the records deleted or the previous state of the table? Prior to deleting the version of the table is 3 and after delete the version of the table is 4.

 
 
 
 

Q137. You noticed a colleague is manually copying the data to the backup folder prior to running an up-date command, incase if the update command did not provide the expected outcome so he can use the backup copy to replace table, which Delta Lake feature would you recommend simplifying the process?

 
 
 
 
 

Q138. The Delta Live Table Pipeline is configured to run in Production mode using the continuous Pipe-line Mode.
what is the expected outcome after clicking Start to update the pipeline?

 
 
 
 
 

Q139. Which of the following programming languages can be used to build a Databricks SQL dashboard?

 
 
 
 
 

Q140. Which of the following SQL statement can be used to query a table by eliminating duplicate rows from the query results?

 
 
 
 
 

Q141. You are working on a marketing team request to identify customers with the same information between two tables CUSTOMERS_2021 and CUSTOMERS_2020 each table contains 25 columns with the same schema, You are looking to identify rows that match between two tables across all columns, which of the following can be used to perform in SQL

 
 
 
 
 

Q142. A data engineering team has created a series of tables using Parquet data stored in an external sys-tem. The
team is noticing that after appending new rows to the data in the external system, their queries within
Databricks are not returning the new rows. They identify the caching of the previous data as the cause of this
issue.
Which of the following approaches will ensure that the data returned by queries is always up-to-date?

 
 
 
 
 

Q143. A new data engineer has started at a company. The data engineer has recently been added to the company’s
Databricks workspace as [email protected]. The data engineer needs to be able to query the table
sales in the database retail. The new data engineer already has been granted USAGE on the database retail.
Which of the following commands can be used to grant the appropriate permissions to the new data engineer?

 
 
 
 
 

Q144. The marketing team is launching a new campaign to monitor the performance of the new campaign for the first two weeks, they would like to set up a dashboard with a refresh schedule to run every 5 minutes, which of the below steps can be taken to reduce of the cost of this refresh over time?

 
 
 
 
 

Q145. What could be the expected output of query SELECT COUNT (DISTINCT *) FROM user on this table

 
 
 
 
 

Q146. You are currently working on a notebook that will populate a reporting table for downstream process consumption, this process needs to run on a schedule every hour, what type of cluster are you going to use to set up this job?

 
 
 
 

Q147. Your team has hundreds of jobs running but it is difficult to track cost of each job run, you are asked to provide a recommendation on how to monitor and track cost across various workloads

 
 
 
 
 

Q148. Which of the following approaches can the data engineer use to obtain a version-controllable con-figuration of the Job’s schedule and configuration?

 
 
 
 
 

Q149. If you create a database sample_db with the statement CREATE DATABASE sample_db what will be the default location of the database in DBFS?

 
 
 
 
 

Q150. Drop the customers database and associated tables and data, all of the tables inside the database are managed tables. Which of the following SQL commands will help you accomplish this?

 
 
 
 
 

Q151. The data engineering team is using a bunch of SQL queries to review data quality and monitor the ETL job every day, which of the following approaches can be used to set up a schedule and auto-mate this process?

 
 
 
 
 

Q152. You were asked to create a notebook that can take department as a parameter and process the data accordingly, which is the following statements result in storing the notebook parameter into a py-thon variable

 
 
 
 
 

Q153. Which of the following technique can be used to implement fine-grained access control to rows and columns of the Delta table based on the user’s access?

 
 
 
 
 

Databricks Databricks-Certified-Professional-Data-Engineer Real 2023 Braindumps Mock Exam Dumps: https://www.prepawaypdf.com/Databricks/Databricks-Certified-Professional-Data-Engineer-practice-exam-dumps.html

Leave a Reply

Your email address will not be published. Required fields are marked *

Enter the text from the image below