Databricks/Databricks Mcq Question Set 2 Sample Test,Sample questions

Question:
 Spark is best suited for ______ data.

1.Real-time

2.Virtual

3.Structured

4.All of the above


Question:
 What is the maximum size of graph DB that a Fixed Container in cosmos DB can store?

1.10GB

2.15GB

3.100GB

4.50GB


Question:
 When Spark runs in Cluster Mode, which of the following statements about nodes is correct ?

1.There is one single worker node that contains the Spark driver and all the executors.

2.The Spark Driver runs in a worker node inside the cluster.

3.There is always more than one worker node.

4.There are less executors than total number of worker nodes.


Question:
 Which of the following DataFrame commands will NOT generate a shuffle of data from each executor across the cluster?

1.df.map()

2.df.collect()

3. df.orderBy()

4.df.repartition()


Question:
 Which type of node is not supported by Azure Batch?

1. Linux nodes

2.Windows nodes

3.Dockers

4.None of these.


Question:
Choose the correct option regarding Azure Storage.

1.It is possible to have role based access control for Blob and Queue storage service of Azure.

2.Shared key authorization is also possible.

3.It is possible to specify a container and its blob public.

4. All of these.


Question:
Choose the correct option.

1.Azure Kubernetes is an open source platform.

2.etcd is used to maintain the state of Kubernetes cluster and configuration.

3.Both A and B.

4.Neither A nor B.


Question:
Choose the correct option.

1.It can integrate with Azure Active Directory.

2.Role based access control is possible in Azure Kubernetes.

3.Both A and B.

4.None of these.


Question:
Choose the correct option.

1.A task is a collection of jobs.

2.A job is a collection of tasks.

3.A job is a collection of compute nodes.

4.A task is a collection of compute nodes.


Question:
Choose the correct option.

1.Azure Batch is a non visual tool.

2. Azure Btach allows users to fully configure the nodes.

3.Azure Batch provides job scheduling and automatically scales and manages the VMs running these jobs.

4.All of these.


Question:
Choose the incorrect option regarding Zone redundant storage replication.

1.It can be used for blobs only

2.3 copies of data are created.

3.Copies of data must be created in the facilities of same region.

4.None of these


Question:
Choose the wrong statement regarding Azure Kubernetes.

1.Use of Azure Kubernetes demands a ver low minimum monthly charge.

2.It can integrate with Visual Studio Code.

3.It provides elastic scalability.

4.None of these.


Question:
Elastically scalable throughput and storage is possible in:

1.Azure Cosmos DB Graph API

2.Azure Cosmos DB SQL API

3.Both A and B

4.none of the above


Question:
For which of the following options, Azure Batch can be used?

1. Fluid Dynamics

2.Image processing.

3.Software test execution.

4.All of the above.


Question:
How data files stored in Azure blob storage are being accessed by Azure Batch?

1.Separate software needs to be installed.

2.Azure Batch already has built in support for accessing those files.

3.With the help of Compute nodes.

4. Azure Batch cannot access those files.


Question:
How many copies of data are created in case of geo redundant storage replication?

1.6

2.2

3.3

4.4


Question:
How many replication options are there while creating an Azure storage account?

1.2

2.5

3.8

4.4


Question:
If you wanted to:
1. Cache a df as SERIALIZED Java objects in the JVM and;
2. If the df does not fit in memory, store the partitions that don’t fit on disk, and read them from there when they’re needed;
3. Replicate each partition on two cluster nodes.
which command would you choose ?

1.df.persist(StorageLevel.MEMORY_ONLY)

2.df.persist(StorageLevel.MEMORY_AND_DISK_SER)

3.df.persist(StorageLevel.MEMORY_AND_DISK_2_SER)

4.df.cache(StorageLevel.MEMORY_AND_DISK_2_SER)


Question:
In how many ways Spark uses Hadoop?

1. 2

2.3

3.4

4.5


Question:
In which of the APIs of Azure Cosmos DB, is automatic indexing possible?

1.Graph API, SQL API, Table API

2.Graph API, SQL API

3.Graph API

4.SQL API, Table API


Question:
Point out the correct statement.

1.Spark enables Apache Hive users to run their unmodified queries much faster

2.Spark interoperates only with Hadoop

3.Spark is a popular data warehouse solution running on top of Hadoop

4.All of the above


Question:
Select the API from the following options that Azure Cosmos DB support?

1.Gremlin API

2. Apache Cassandra API

3.Mongo DB API

4.All of the above


Question:
The code below should return a new DataFrame with 50 percent of random records from DataFrame df without replacement.

1.df.sample(False, 0.5, 5)

2.df.random(False, 0.5, 5)

3.df.sample(False, 5, 25)

4.df.sample(False, 50, 5)


Question:
The DataFrame df includes a time string column named timestamp_1. Which is the correct syntax that creates a new DataFrame df1 that is just made by the time string field converted to a unix timestamp?

1.df1 = df.select(unix_timestamp(col(“timestamp_1″),”MM-dd-yyyy HH:mm:ss”).as(“timestamp_1”))

2.df1 = df.select(unix_timestamp(col(“timestamp_1″),”MM-dd-yyyy HH:mm:ss”, “America/Los Angeles”).alias(“timestamp_1”))

3.df1 = df.select(unix_timestamp(col(“timestamp_1″),”America/Los Angeles”).alias(“timestamp_1”))

4.df1 = df.select(unix_timestamp(col(“timestamp_1″),”MM-dd-yyyy HH:mm:ss”).alias(“timestamp_1”))


Question:
What are compute nodes in Azure Batch?

1.Applications

2.Job

3.Virtual Machines

4.Task


Question:
What can be the maximum size of a queue message?

1.256 KB

2.64 KB

3.128 KB

4.No maximum size is there.


Question:
What does Azure Batch provide by default for parallelization?

1.cluster

2.container

3.multiple nodes

4.None of these.


Question:
What is the basic operational unit of Kubernetes?

1.Pod

2.Container

3.Nodes

4.Task


Question:
What is the maximum latency limit for reads and writes in case of Azure Cosmos DB Table API?

1.100ms

2.10ms

3.50ms

4.5ms


Question:
When was Apache Spark developed ?

1.2007

2.2008

3. 2009

4. 2010


Question:
Which container is supported in Azure Cosmos DB?

1.Fixed container

2.Unlimited Container

3.Both Fixed container and Unlimited Container

4.Neither Fixed container nor Unlimited Container


Question:
Which of the following Azure services is used for performing high performance parallel computing jobs in the cloud?

1.Azure Batch Service.

2.Azure Kubernetes Service.

3.Azure Key Vault.

4.Azure App Services.


Question:
Which of the following DataFrame commands is a narrow transform?

1.df.drop()

2.df.collect()

3.df.orderBy()

4.df.repartition()


Question:
Which of the following DataFrame commands is a wide transform?

1.df.drop()

2.df.contains()

3.df.filter()

4.df.repartition()


Question:
Which of the following Features of Apache Spark?

1. Speed

2.Supports multiple languages

3.Advanced Analytics

4. All of the above


Question:
Which of the following is incorrect way for Spark deployment?

1.Standalone

2.Hadoop Yarn

3. Spark in MapReduce

4.Spark SQL


Question:
Which of the following is True regarding Azure Cosmos DB?

1. It supports relational data model.

2.It can be scaled horizontally.

3.The data can be distributed to fixed number of Azure regions.

4.Both a and b


Question:
Which one of the following can be considered as the primary data store of Kubernetes?

1.node

2.pod

3.VM scale sets.

4.etcd


Question:
Which one of the following can be done for a container based application using Azure Kubernetes?

1.Making container scalability easy.

2.Make workloads portable.

3.Build more extensible apps.

4.All of the above.


Question:
Which one of the following helps to set up cluster autoscaler for adding capacity as per demand?

1.Virtual nodes

2.VM Scale sets

3.Container

4.None of the above.


Question:
Which one of the following is a data model supported by Azure Cosmos DB?

1. key-value

2.graph

3.table

4.All of the above


Question:
Which one of the following is an orchestration software which can be used for scaling containers?

1. Azure Batch.

2.Azure Kubernetes.

3.Azure Data Factory.

4.Azure key vault.


Question:
Which one of the following is correct regarding clusters of Azure Kubernetes?

1.Cluster name need not be unique within the selected resource group.

2. Azure CLI can be used to create clusters.

3.Both A and B.

4. Neither A nor B.


Question:
Which one of the following is incorrect regarding Azure Batch?

1.It allows to run large-scale parallel workloads although the cost is high.

2.Auto scaling is possible whenever required.

3.By auto scaling, it means that it can provide more nodes if the number of queued tasks are more.

4.None of these.


Question:
Which one of the following is incorrect regarding Azure Kubernetes?

1.Azure Kubernetes does not mandatorily need resources to be created in cloud.

2.Azure Kubernetes manages and makes deployment of container based applications easy.

3.Azure Kubernetes helps in automatic scheduling of container based application.

4.None of these.


Question:
Which one of the following is most preferred for storing streaming videos and audios?

1. Azure Files

2. Azure Queues

3.Azure Blobs

4.Azure Tables


Question:
Which one of the following is not correct regarding Azure storage?

1.Datas are highly available.

2.Storing datas in Azure Storage is secured.

3.No redundant data

4.None of these.


Question:
Which one of the following is the data service provided by Azure Storage platform?

1.Azure Blobs

2.Azure Tables

3.Azure Queues

4.All of these.


Question:
Which one of the following is the feature of Azure Cosmos DB Graph API?

1.Automatic indexing

2.Fully managed

3.Multi-region replication

4.All of the above


Question:
Which one of the following logical partitions a single container in cosmos DB cannot have?

1.PHYSICAL

2.SALES

3.HR

4.Product backlog


Question:
Which one of the following provides block level storage volumes for Azure VMs?

1.Azure Disks

2.Azure Blobs

3.Azure Queues

4.Azure Tables


Question:
While creating Azure Storage account, which replication option is the cheapest one?

1. Zone redundant storage

2.Locally redundant storage

3.Geo redundant storage

4.Read access geo redundant storage


Question:
With which of the following can Azure Batch integrate for fetching data?

1.Azure Blob Storage only

2.Azure Data Lake storage only

3.Both Azure Blob Storage and Azure Data Lake Storage.

4.Neither Azure Blob Storage nor Azure Data Lake Storage.


Question:
_____ is a distributed graph processing framework on top of Spark.

1.MLlib

2.Spark Streaming

3. GraphX

4.none of the above


More MCQS

  1. Databricks Mcq Question Set 1
  2. Databricks Mcq Question Set 2
Search
R4R Team
R4Rin Top Tutorials are Core Java,Hibernate ,Spring,Sturts.The content on R4R.in website is done by expert team not only with the help of books but along with the strong professional knowledge in all context like coding,designing, marketing,etc!