Data Engineering on Microsoft Azure Test 11

Welcome to your Data Engineering on Microsoft Azure - Test-11

Name

Phone

Q1. Scenario: You are working on a project and are preparing to ingest data from a SQL Server database hosted on an on-premises Windows Server.

Which integration runtime is required for Azure Data Factory to ingest data from the on-premises server?

Select 1 option(s):

Azure-SSIS Integration Runtime

Azure Integration Runtime

On-demand HDInsight cluster

Self-Hosted Integration Runtime

None of the listed options.

Q2. Scenario: You are working on a project where you create a DataFrame which is designed to read data from Azure Blob Storage. Next, you plan to create as additional DataFrame by filtering the initial DataFrame.

Which feature of Spark causes these transformation to be analyzed?

Select 1 option(s):

Tungsten Record Format

Lazy Execution

Java Garbage Collection

Cluster configuration

Q3. While Agile, CI/CD, and DevOps are different, they support one another.

Which is best described by:
“Focuses on culture highlighting roles that emphasize.”

Select 1 option(s):

DevOps

SDLC

Agile

CI/CD

Q4. Knowing now the different concepts of spark it is imperative to understand how it fits in with the different Data services on Azure.

Which of the following is best described by:
“An open-source memory optimized system for managing big data workloads, which is used when you want a spark engine for big data processing or data science where you don’t mind that there is no SLA provided. Usually it is of interest to Open Source Professionals and the reason for this product is to overcome the limitations known as SMP systems for big data workloads.”

Select 1 option(s):

Azure Databricks

HDI

Spark Pools in Azure Synapse Analytics

Apache Spark

Q5. Which of the following tools is used to create and deploy SQL Server Integration Packages on an Azure-SSIS integration runtime, or for on-premises SQL Server?

Select 1 option(s):

SQL Server Data Tools

SQL Server Management Studio

dtexec

Data Migration Service

SQL Server Upgrade Advisor

Data Migration Assistant

Q6. What is the Databricks Delta command to display metadata?

Select 1 option(s):

SHOW SCHEMA tablename

DESCRIBE DETAIL tableName

METADATA SHOW tablename

MSCK DETAIL tablename

Q7. What is a lambda architecture and what does it try to solve?

Select 1 option(s):

An architecture that splits incoming data into two paths - a batch path and a streaming path. This architecture helps address the need to provide real-time processing in addition to slower batch computations.

An architecture that defines a data processing pipeline whereby microservices act as compute resources for efficient large-scale data processing.

An architecture that employs the latest Scala runtimes in one or more Databricks clusters to provide the most efficient data processing platform available today.

None of the listed options.

Q8. Scenario: The company you work at is a financial services firm, and can only have account managers allowed to access a customer’s social insurance number, phone numbers or other personal identifiable information. It is imperative to distinguish the role of an account manager versus the manager of the account managers.

Which type of security would typically be best used in for this scenario?

Select 1 option(s):

Dynamic Data Masking

Table-level security

Row-level security

Column-level security

Q9. When you want to switch to SparkSQL in a notebook, what is the first command to type?

Select 1 option(s):

%%sparksql

%%csharp

%%sql

%%pyspark

%%spark

Q10. Monitoring is a key part of any mission-critical workload. It helps to proactively detect and prevent issues that might otherwise cause application or service downtime.

You can monitor Azure Stream Analytics jobs by using which of the following? (Select all that apply)

Select 4 option(s):

Diagnostic logs

Real-time dashboards that show service and application health trends

Predictive dashboards that show expected service and application health status

Alerts on issues in applications or services

An activity log for each running job

Q11. Which data processing framework will a data engineer use to ingest data onto cloud data platforms in Azure?

Select 1 option(s):

Online transaction processing (OLTP)

Atomicity, Consistency, Isolation, and Durability (ACID)

Extract, load, and transform (ELT)

Extract, transform, and load (ETL)

Automated Data Processing Equipment (ADPE)

Q12. Azure Storage provides a REST API to work with the containers and data stored in each account. Client libraries can save a significant amount of work for app developers because the API is tested and it often provides nicer wrappers around the data models sent and received by the REST API. Microsoft has Azure client libraries that support a number of languages and frameworks.

Which are Azure Storage supported languages and frameworks? (Select all that apply)

Select 6 option(s):

Java

Python

.NET

Node.js

Unattempted

Q13. It is a good practice to store documentation about a data source.

Which Azure service is the best choice to do this?

Select 1 option(s):

Azure Databricks

Azure Data Lake Storage

Azure Data Factory

Azure Stream Analytics

Azure Data Catalogue

Q14. Scenario: You are determining the type of Azure service needed to fit the following specifications and requirements:

Data classification: Semi-structured because of the need to extend or modify the schema for new products
Operations:
• Customers require a high number of read operations, with the ability to query many fields within the database.
• The business requires a high number of write operations to track its constantly changing inventory.
Latency & throughput: High throughput and low latency.
Transactional support: Because all of the data is both historical and yet changing, transactional support is required.
Which would be the best Azure service to select?

Select 1 option(s):

Azure Blob Storage

Azure Queue Storage

Azure Cosmos DB

Azure Route Table

Azure SQL Database

Q15. On initial deployment Azure Synapse Analytics, there are a few resources that deploy along with it.

Which of the following are deployed along with Azure Synapse Analytics? (Select all that apply)

Select 2 option(s):

Azure Data Lake Storage Gen2

Azure Queue Storage

Azure Kubernetes Service

Azure Synapse Workspace

Azure Machine Learning

Q16. Scenario: You team is working on a project using the Azure Data Factory authoring tool.

A junior team member comes to you and asks “Where can I find the Copy Data activity ?”
Which of the below is the correct location?

Select 1 option(s):

Data Explorer

Azure Function

Batch Service

Move & Transform

Databricks

Q17. Scenario: You are working as a consultant at Advanced Idea Mechanics (A.I.M.) who is a privately funded think tank organized of a group of brilliant scientists whose sole dedication is to acquire and develop power through technological means. Their goal is to use this power to overthrow the governments of the world. They supply arms and technology to radicals and subversive organizations in order to foster a violent technological revolution of society while making a profit.

The company has 10,000 employees. Most employees are located in Europe. The company supports teams worldwide.
AIM has two main locations: a main office in London, England, and a manufacturing plant in Berlin, Germany.
During events, 100 engineers set up a remote portable office by using a VPN to connect the datacentre in the London office. The portable office is set up and torn down in approximately 20 different countries each year.
AIM runs Microsoft SQL Server in an on-premises virtual machine (VM).
Required:
• Migration of the database to Azure SQL Database
• Synchronize users from Active Directory to Azure Active Directory (Azure AD)
• Configure Azure SQL Database to use an Azure AD user as administrator
Which of the following should be configured?

Select 1 option(s):

For each Azure SQL Database, set the Active Directory administrator role.

For each Azure SQL Database server, set the Active Directory to administrator.

For each Azure SQL Database server, set the Access Control to administrator.

For each Azure SQL Database, set the Access Control to administrator.

Q18. Scenario: O’Shaughnessy’s is a fast food restaurant. The chain has stores nationwide and is rivalled by Big Belly Burgers. You have been hired by the company to advise on the implementation of Azure migrating from an on-prem datacentre.

The IT team is working on a project to implement a lambda architecture on Microsoft Azure using an open-source big data solution for the purpose of aggregating, processing and maintaining data. During testing it is noted that the analytical data store is performing below expectations and management has come to you with the following requirement specifications.
Requirements:
• The solution must provide data warehousing
• The solution must reduce ongoing management activities
• The solution must deliver SQL query responses under one second
• The solution must create an HDInsight cluster to which fulfills all the listed requirements
As the expert consultant, the IT team is looking to you for direction. Which type of cluster should you advise them to create?

Select 1 option(s):

Interactive Query

Apache HBase

Apache Spark

Apache Hadoop

Q19. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

Azure Synapse Analytics can work by acting as the one stop shop to meet all of your analytical needs in an integrated environment if you do not have an analytical environment in place already.
[?] is a single web UI that allows you to:
• Explore your data estate.
• Develop TSQL scripts and notebooks to interact with the analytical engines.
• Build data integration pipelines for managing data movement.
• Monitor the workloads within the service.
• Manage the components of the service.

Select 1 option(s):

Azure Pipelines

Azure Monitor

Azure DevOps

Azure Synapse Studio

Azure Designer

Azure Portal

Q20. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

[?] for Storage provides an extra layer of security intelligence that detects unusual and potentially harmful attempts to access or exploit storage accounts. This layer of protection allows you to address threats without being a security expert or managing security monitoring systems.
Security alerts are triggered when anomalies in activity occur. These security alerts are integrated with Azure Security Centre, and are also sent via email to subscription administrators, with details of suspicious activity and recommendations on how to investigate and remediate threats.

Select 1 option(s):

Azure Defender

Azure Vault

Azure Armour

Azure RBAC

Azure Shield

Q21. Azure offers a service to detect anomalies in account activities. These anomalies generate security alerts which are integrated with Azure Security Centre, and are also sent via email to subscription administrators, with details of suspicious activity and recommendations on how to investigate and remediate threats.

Which of the below is the name of this service?

Select 1 option(s):

Encryption in transit

Azure Storage Account Security Feature

Azure Defender for Storage

Azure Shield for Storage

Azure Armour for Storage

Q22. True or False: In Azure Data Factory, in order to debug pipelines or activities, it is necessary to publish your workflows. Pipelines or activities which are being tested may be confined to containers to isolate them from the production environment.

Select 1 option(s):

FALSE

TRUE

Q23. Scenario: Your organization must respond to data events in real time in a continuous time-bound stream. The company must monitor IoT devices combined with remote patient monitoring to dispatch life-critical services.

Which would be the best Azure product to use?

Select 1 option(s):

Azure Table Storage

Azure DataNow

Azure Cosmos DB

Azure Stream Analytics

Azure On-prem solution

Azure Synapse Analytics

Q24. Activities within Azure Data Factory define the actions that will be performed on the data and there are three categories including:

• Data movement activities
• Data transformation activities
• Control activities
Pipelines in Data Factory are defined in JSON format as follows:
JSON
{
“name”: “PipelineName”,
“properties”:
{
“description”: “pipeline description”,
“activities”:
[
],
“parameters”: {
}
}
}
Which of the JSON properties are required? (Select all that apply)

Select 2 option(s):

name

description

parameters

activities

Q25. During the process of creating a notebook, you need to specify the pool that needs to be attached to the notebook that is, a SQL or Spark pool.

True or False: Notebook cells are individual blocks of code or text that runs as a group. If you want to skip cells within the group, a simple skip notation in the cell is all that is required.

Select 1 option(s):

TRUE

FALSE

Q26. Which of the following services allow customers to store semi-structured datasets in Azure.

Select 2 option(s):

Azure Cosmos DB

Azure Table Storage

Azure File Storage

Azure SQL Database

Azure Content Delivery Network (CDN)

Azure SQL for VM

Q27. Azure Synapse SQL pools support placing complex data processing logic into Stored procedures

True or False: Multiple users and client programs can perform operations on underlying database objects through a procedure, even if the users and programs do not have direct permissions on those underlying objects.

Select 1 option(s):

FALSE

TRUE

Q28. As great as data lakes are at inexpensively storing our raw data, they also bring with them performance challenges:

• Too many small or very big files – more time opening & closing files rather than reading contents (worse with streaming).
• Partitioning also known as “poor man’s indexing”- breaks down if you picked the wrong fields or when data has many dimensions, high cardinality columns.
• No caching – cloud storage throughput is low (cloud object storage is 20-50MB/s/core vs 300MB/s/core for local SSDs).
As a solution to the challenges with Data Lakes noted above, Delta Lake is a file format that can help you build a data lake comprised of one or many tables in Delta Lake format. Delta Lake integrates tightly with Apache Spark, and uses an open format that is based on Parquet.
Two of the core features of Delta Lake are performing UPSERTS and Time Travel operations.
What does the Time Travel operation do? (Select all that apply)

Select 4 option(s):

Writing complex temporal queries.

Re-creating analyses, reports, or outputs (for example, the output of a machine learning model). This could be useful for debugging or auditing, especially in regulated industries.

Providing snapshot isolation for a set of queries for fast changing tables.

Because Delta Lake is version controlled, you have the option to query past versions of the data using a single file storage system.

Unattempted

Q29. Scenario: A teammate is working on solution for transferring data between a dedicated SQL Pool and a serverless Apache Spark Pool using the Azure Synapse Apache Spark Pool to Synapse SQL connector.

When could SQL Auth be used for this connection?

Select 1 option(s):

Always, anytime you want to transfer data between the SQL and Spark Pool.

Never, it is not necessary to use SQL Auth when transferring data between a SQL or Spark Pool.

None of the listed options.

When you need a token-based authentication to a dedicated SQL outside of the Synapse Analytics workspace.

Q30. Scenario: Your team has deployed a factory to production and realizes there’s a bug that needs to be fixed right away, but you can’t deploy the current collaboration branch.

What is the best action to take?

Select 1 option(s):

Deploy a hotfix

Utilize a workhole

None of the listed options

Deploy a timeshift

Create a rollback to a savepoint

Q31. Azure Data Factory provides a variety of methods for ingesting data, and also provides a range of methods to perform transformations.

These methods are:
• Mapping Data Flows
• Compute Resources
• SSIS Packages
Mapping Data Flows provides a number of different transformations types that enable you to modify data. They are broken down into the following categories:
• Schema modifier transformations
• Row modifier transformations
• Multiple inputs/outputs transformations
Which of the following are valid transformations available in the Mapping Data Flow? (Select all that apply)

Select 6 option(s):

Lookup

Aggregate

Alter row

Exists

Filter

Union

Unattempted

Q32. Scenario: You have been contracted by Wayne Enterprises, a company owned by Bruce Wayne with market value of over twenty seven million dollars. Bruce founded Wayne Enterprises shortly after he created the Wayne Foundation and he became the president and chairman of the company.

Bruce has come to you because his IT team needs advice on the configuration and synchronization of data between an on-premises Microsoft SQL Server database to Azure SQL Database.
Recently, ad-hoc and reporting queries are being overutilized on the on-premises production instance and your expert advise is required on the following points.
Requirements:
• Execute an initial data synchronization to Azure SQL Database (minimize downtime)
• Execute bi-directional data synchronization after initial synchronization
A synchronization solution must be created and implemented and Bruce and the team look to you as the Azure expert. Which synchronization method should you advise the team to use?

Select 1 option(s):

Azure SQL Data Sync

Backup and restore

SQL Server Agent job

Transactional replication

Data Migration Assistant

Q33. Azure Storage accounts are the base storage type within Azure. Azure Storage offers a very scalable object store for data objects and file system services in the cloud. It can also provide a messaging store for reliable messaging, or it can act as a NoSQL store.

Which of the following are Azure Storage configuration options? (Select all that apply)

Select 4 option(s):

Azure Cosmos DB

Azure Database Server

Azure Queue

Azure Table

Azure Files

Azure Blob

Q34. The following are the facets of Azure Databricks security:

• Data Protection
• IAM/Auth
• Network
• Compliance
Which of the following comprise Data Protection within Azure Databricks security? (Select five)

Select 5 option(s):

VNet Injection

Vault Secrets

AAD

Managed Keys

ACLs

TLS

Q35. The first step in deploying Azure Synapse Analytics is to deploy an Azure Synapse Analytics workspace. A shared Hive-compatible metadata system allows tables defined on files in the data lake to be seamlessly consumed by either Spark or Hive.

SQL and Spark can directly explore and analyze which types of files stored in the data lake? (Select all that apply)

Select 4 option(s):

Parquet

CSV

TSV

XLSX

PDF

JSON

Q36. Which feature of Spark determines how your code is executed?

Select 1 option(s):

Java Garbage Collection

Tungsten Record Format

Catalyst Optimizer

Cluster Configuration

Q37. Which is the correct syntax for overwriting data in Azure Synapse Analytics from a Databricks notebook?

Select 1 option(s):

df.write.mode("overwrite").option("...").option("...").save()

df.write.format("com.databricks.spark.sqldw").update().option("...").option("...").save()

df.write.format("com.databricks.spark.sqldw").overwrite().option("...").option("...").save()

df.write.format("com.databricks.spark.sqldw").mode("overwrite").option("...").option("...").save()

Q38. Azure offers several types of storage for data, the one chosen should depend on the needs of the users. Each data store has a different price structure. When you want to store data but don’t need to query it, which would be the most cost efficient choice?

Select 1 option(s):

Azure Stream Analytics

Azure Storage

Azure Databricks

Azure Data Catalogue

Azure Data Factory

Azure Data Lake Storage

Q39. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

In Azure Data Factory, a(n) [?] is a logical grouping of activities that together perform a task.

Select 1 option(s):

Sink

Pipeline

Activity

Orchestration

Linked Service

Q40. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

A(n) [?] is an orchestration of pipeline activities that includes chaining activities in a sequence, branching, defining parameters at the pipeline level, and passing arguments while invoking the pipeline on demand or from a trigger.

Select 1 option(s):

Procedure

Control Flow

Activity

Test Lab

Data Flow

Workflow

Q41. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

When working with large data sets, it can take a long time to run the sort of queries that clients need. These queries can’t be performed in real time, and often require algorithms such as MapReduce that operate in parallel across the entire data set. The results are then stored separately from the raw data and used for querying.
One drawback to this approach is that it introduces latency. If processing takes a few hours, a query may return results that are several hours old. Ideally, you would like to get some results in real time (perhaps with some loss of accuracy), and combine these results with the results from the batch analytics.
The Lambda architecture is a big data processing architecture that addresses this problem by combining both batch- and real-time processing methods. It features an append-only immutable data source that serves as system of record. Timestamped events are appended to existing events (nothing is overwritten). Data is implicitly ordered by time of arrival.
The [?] is a vast improvement upon the traditional Lambda architecture. At each stage, we enrich our data through a unified pipeline that allows us to combine batch and streaming workflows through a shared filestore with ACID-compliant transactions.

Select 1 option(s):

Anaconda architecture

No-SQL architecture

Data Lake architecture

Data Sea architecture

Serverless architecture

Delta Lake architecture

Q42. When talking about the Azure Databricks workspace, we refer to two different things.

• The first reference is the logical Azure Databricks environment in which clusters are created, data is stored (via DBFS), and in which the server resources are housed.
• The second reference is the more common one used within the context of Azure Databricks.
The first step to using Azure Databricks is to create and deploy a Databricks workspace, which is the logical environment. You can do this in the Azure portal.
There are a number of required values to create your Azure Databricks workspace.
Which are they? (Select five)

Select 5 option(s):

Databricks RuntimeVersion

Subscription

Workspace Name

Resource Group

Pricing Tier

Location

Q43. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

[?] provides one-click setup, streamlined workflows, an interactive workspace for Spark-based applications plus it adds capabilities to Apache Spark, including fully managed Spark clusters and an interactive workspace.

Select 1 option(s):

Azure Cosmos DB

Azure Data Lake Storage

Azure Databricks

Azure Data Catalogue

Azure SQL Datawarehouse

Azure Storage Explorer

Q44. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

Microsoft Azure Storage is a managed service that provides durable, secure, and scalable storage in the cloud. Azure Files enables you to set up highly available network file shares that can be accessed using the standard Server Message Block (SMB) protocol. This means that multiple VMs can share the same files with both read and write access. You can read the files using the REST interface or the storage client libraries. You can also associate a unique URL to any file to allow fine-grained access to a private file for a set period of time.
Which are common scenarios where File shares can be used? (Select all that apply)

Select 3 option(s):

Shared data between on-premises applications and Azure VMs to allow migration of apps to the cloud instantly.

Shared data between on-premises applications and Azure VMs to allow migration of apps to the cloud over a period of time.

Storing shared configuration files for VMs, tools, or utilities so that everyone is using unique versions.

Storing shared configuration files for VMs, tools, or utilities so that everyone is using the same version.

Log files such as diagnostics, metrics, and crash dumps.

Q45. Scenario: A customer of Ultron Electronics is attempting to use a $300 store credit for the full amount of a new purchase. They are trying to double-spend their credit by creating two transactions at the exact same time using the entire store credit. The customer is making two transactions using two different devices.

The database behind the scenes is an ACID-compliant transactional database.
What would be the result?

Select 1 option(s):

None of the listed options.

One order would be processed and use the in-store credit, and the other order would update the remaining inventory for the items in the basket, but would not complete the order.

Both orders would be processed and use the in-store credit.

One order would be processed and use the in-store credit, and the other order would not be processed.

Q46. Synapse Studio comes with an integrated notebook experience. The notebooks in Synapse studio, are a web interface that enables you to create, edit, or transform data in the files. It is based on a live code experience, including visualizations and narrative text.

True or False: You can access data in the primary storage account directly. There’s no need to provide the secret keys.

Select 1 option(s):

TRUE

FALSE

Q47. Connectors are Azure Data Factory objects that enable your Linked Services and Datasets to connect to a wide variety of data sources and sinks. These can include connections to Azure resources and third-party connectors such as Amazon S3 or Google cloud. There are nearly 100 connectors that are available.

Which are among the file formats supported? (Select six)

Select 6 option(s):

Avro format

JSON format

Delimited text format

ORC format

Binary format

Parquet format

Unattempted

Q48. Scenario: You are working on a project using Azure Synapse Studio and want to configure a private endpoint. You open up Azure Synapse Studio, go to the manage hub, and see that the private endpoints is greyed out.

Why is the option not available?

Select 1 option(s):

A conditional access policy has to be defined first.

There are service interruptions which must be troubleshot first.

A managed virtual network has not been created.

Azure Synapse Studio does not support the creation of private endpoints.

Q49. Which DataFrame method do you use to create a temporary view?

Select 1 option(s):

tempViewCreate()

createTempView()

createTempViewDF()

createOrReplaceTempView()

Q50. To provide a better authoring experience, Azure Data Factory allows you to configure version control software for easier change tracking and collaboration. Which of the below does Azure Data Factory integrate with? (Select all that apply)

Select 1 option(s):

Launchpad

Google Cloud Source Repositories

Source Safe

BitBucket

Git repositories

Team Foundation Server

Q51. Scenario: Jungle.com uses Azure Cosmos DB to store sales orders and customer profile data from their eCommerce site. The NoSQL document store provided by the Azure Cosmos DB provides the familiarity of managing their data using SQL syntax, while being able to read and write the files at a massive, global scale.

While Jungle.com is happy with the capabilities and performance of Azure Cosmos DB, they are concerned about the cost of executing a large volume of complex analytical queries needed to fulfill their operational reporting requirements.
They want to efficiently access all their operational data stored in Cosmos DB without needing to increase the Azure Cosmos DB throughput and associate cost. They have looked at options for extracting data from their containers to the data lake as it changes, through the Azure Cosmos DB change feed mechanism.
The problem with this approach is the extra service and code dependencies and long-term maintenance of the solution. They could perform bulk exports from a Synapse Pipeline, but then they won’t have the most up-to-date information at any given moment.
Which would be the best action to take?

Select 1 option(s):

Enable Azure Synapse Link for Cosmos DB and enable the analytical store on their Azure Cosmos DB containers.

Enable Azure VNet Peering for Cosmos DB and enable the analytical store on their Azure Cosmos DB containers.

Enable Azure Dedicated Connect for Cosmos DB and enable the analytical store on their Azure Cosmos DB containers.

Enable Azure VPN Gateway for Cosmos DB and enable the analytical store on their Azure Cosmos DB containers.

Q52. Which Index Type offers the highest compression in Synapse Analytics?

Select 1 option(s):

Rowstore

Columnstore

Round-Robin

Heap

Replicated

Q53. Scenario: O’Shaughnessy’s is a fast food restaurant. The chain has stores nationwide and is rivalled by Big Belly Burgers. You have been hired by the company to advise on the implementation of Azure migrating from an on-prem datacentre.

The IT team has an Azure subscription which contains an Azure Storage account and they plan to create an Azure container instance named OShaughnessy001 that will use a Docker image named Source001. Source001 contains a Microsoft SQL Server instance that requires persistent storage. Right now the team is configuring a storage service for OShaughnessy001 and there is debate around which of the following should be used.
As the expert consultant, the team looks to you for direction.
Which should you advise them to use?

Select 1 option(s):

Azure Table storage

Azure Queue storage

Azure Blob storage

Azure Files

Q54. One of the key management features that you have at your disposal within Azure Synapse Analytics, is the ability the scale the compute resources for SQL or Spark pools to meet the demands of processing your data. Compute is separate from storage, which enables you to scale compute independently of the data in your system. This means you can scale up and scale down the compute power to meet your needs.

Apache Spark pools for Azure Synapse Analytics uses an Autoscale feature that automatically scales the number of nodes in a cluster instance up and down.
Autoscale continuously monitors the Spark instance and collects which of the following metrics? (Select five)

Select 5 option(s):

Total Pending Memory

Average Refresh rate

Total Free Memory

Total Free CPU

Used Memory per Node

Total Pending CPU

Q55. Azure Synapse Analytics is a high performing Massively Parallel Processing (MPP) engine that is built with loading and querying large datasets in mind.

There are times though when performance expectations are not met, and it is necessary then to know what aspects of the table structures and architecture can be reviewed and adapted to maximize query performance.
What is the following code intended to accomplish?

Select 1 option(s):

PowerShell

Set-AzSqlDatabase -ResourceGroupName "resourcegroupname" -DatabaseName "mySampleDataWarehouse" -ServerName "sqlpoolservername" -RequestedServiceObjectiveName "DW300c"

Address the issue of poor response time.

Address the issue of poor query performance.

Address the issue of poor load performance.

Address the issue of low concurrency.

Q56. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

A(n) [?] schema may be defined at query time.

Select 1 option(s):

Hybrid data type

Unstructured data type

Azure Cosmos DB data type

Structured data type

Q57. Authentication is the process of validating credentials as you access resources in a digital infrastructure. This ensures that you can validate that an individual, or a service that wants to access a service in your environment can prove who they are. Azure Synapse Analytics provides several different methods for authentication.

Which are valid authentication methods in Azure Synapse Analytics? (Select all that apply)

Select 6 option(s):

SQL Authentication

Azure Key Vault

MFA

Azure Active Directory

SAS

Managed identity

Unattempted

Q58. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

Azure Synapse Analytics is an integrated analytics platform, which combines data warehousing, big data analytics, data integration, and visualization into a single environment. Azure Synapse Analytics empowers users of all abilities to gain access and quick insights across all of their data, enabling a whole new level of performance and scale.
Diagnostic analytics deals with answering the question [?].

Select 1 option(s):

"What is likely to happen in the future based on previous trends and patterns?”

"What is happening in my business?"

"Why is it happening?"

"When will the modification made meet my goals?"

Q59. Scenario: You are working at an online retailer and have been tasked with finding average of sales transactions by storefront.

Which of the following aggregates would you use?

Select 1 option(s):

df.select(col("storefront")).avg("completedTransactions").groupBy(col("storefront"))

df.groupBy(col("storefront")).avg("completedTransactions")

df.groupBy(col("storefront")).avg(col("completedTransactions"))

df.select(col("storefront")).avg("completedTransactions")

Q60. What does Azure Data Lake Storage (ADLS) Passthrough enable?

Select 1 option(s):

Blocking ADLS resources through a mount point when credential passthrough is enabled.

Commands running on a configured cluster can read and write data in ADLS without configuring service principal credentials.

User security groups that are added to ADLS are automatically created in the workspace as Databricks groups.

Automatically mounting ADLS accounts to the workspace that are added to the managed resource group.

Q61. True or False: Azure Synapse functionality requires integration with Azure Data Factory, Azure Databricks and Power BI.

Select 1 option(s):

FALSE

TRUE

Q62. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

As a Data Engineer, you can transfer and move data in several ways. The most common tool is Azure Data Factory which provides robust resources and nearly 100 enterprise connectors. Azure Data Factory also allows you to transform data by using a wide variety of languages.
Azure has opened the way for technologies that can handle unstructured data at an unlimited scale. This change has shifted the paradigm for loading and transforming data from [?].

Select 1 option(s):

ELT ? ETL

ETL ? ELT

ETL ? RTO

MTD ? RPO

RPO ? RTO

ETL ? MTD

Q63. Scenario: The organization you work at has data which is specific to a country or region due to regulatory control requirements.

When considering Azure Storage Accounts, which option meets the data diversity requirement?

Select 1 option(s):

None of the listed options.

Locate the organization’s data it in a data centre with the strictest data regulations to ensure that regulatory requirement thresholds have been met. In this way, only one storage account will be required for managing all data, which will reduce data storage costs.

Enable virtual networks for the proprietary data and not for the public data. This will require separate storage accounts for the proprietary and public data.

Locate the organization’s data it in a data centre in the required country or region with one storage account for each location.

Q64. Identify the missing word(s) in the following sentence within the context of Microsoft Azure.

Many business application architectures separate transactional and analytical processing into separate systems with data stored and processed on separate infrastructures. These infrastructures are commonly referred to as OLTP (online transaction processing) systems working with operational data, and OLAP (online analytical processing) systems working with historical data, with each system is optimized for their specific task.
Azure Cosmos DB provides … [?]

Select 1 option(s):

A transactional store optimized for transactional workloads and a fully managed autosync process to keep the data within these stores in sync.

None of the listed options.

Both a transactional store optimized for transactional workloads and an analytical store optimized for analytical workloads and a fully managed autosync process to keep the data within these stores in sync.

An analytical store optimized for analytical workloads and a fully managed autosync process to keep the data within these stores in sync.

Q65. True or False: The Apache Spark history server can be used to debug and diagnose completed only.

Select 1 option(s):

TRUE

FALSE