Snowflake MCQs

Download as pdf or txt
Download as pdf or txt
You are on page 1of 15
At a glance
Powered by AI
The key takeaways are that Snowflake's architecture is separated into three main layers - the cloud services layer, the query processing layer, and the database storage layer. The cloud services layer provides centralized services like security, the query processing layer contains virtual warehouses for data access, and the database storage layer stores data in a compressed columnar format. Some other takeaways are around Snowflake's support for data loading, security features, and data modeling techniques.

Snowflake's architecture consists of three main layers - the cloud services layer provides services like security, metadata, and transaction management. The query processing layer contains virtual warehouses that users connect to for querying data. The database storage layer stores data in a compressed columnar format divided into immutable micro-partitions.

Some best practices for loading data include splitting large files into smaller 10-100MB files, splitting files by line to avoid records spanning chunks, and using COPY in parallel up to the number of files. The number of micro-partitions is determined by Snowflake, not the user, and subsequent changes create new micro-partitions.

True or False: Data Storage is independent from compute?

True
True or False: All virtual warehouses have access to all data?
True
True or False: Al interactions with data are initialized through the services layer?
True
True or False: Multi-Cluster Warehouses support high concurrency?
True
True or False: Warehouses can be dynamically expanded to adjust to workloads?
True
Which layer provides centralized services for the data warehouse?
Cloud Services
Which layer contains virtual warehouses for data access?
Query Processing
Which layer contains the data in compressed, columnar format?
Database Storage
True or False: Snowflake's security and authentication includes Multi-Factor
authentication?
True
True or False: Snowflake's security and authentication includes Snowflake Failures
alerts?
False
True or False: Snowflake's security and authentication includes data encryption
using Snowflake-managed keys?
False
True or False: Snowflake's security and authentication includes object-level access?
True
True or False: Snowflake's architecture includes advance capabilities in the cloud
services layer that delivers transaction management for consistent operations on the
same data at the same time?
True
True or False: Snowflake's architecture includes advance capabilities in the cloud
services layer that delivers metadata service?
True
True or False: Snowflake's architecture includes advance capabilities in the cloud
services layer that delivers security and authentication control?
True
True or False: Snowflake's architecture includes advance capabilities in the cloud
services layer that delivers query optimization?
True
True or False: Snowflake supports landing data into internal stage on the cloud
storage platform?
True
True or False: Snowflake supports landing data into external stage on the cloud
storage platform?
True
True or False: Snowflake supports landing data into bring your own device?
False
True or False: Compute resources used by Snowflake for data loading jobs can by
provide by user managed virtual warehouse?
True
True or False: Compute resources used by Snowflake for data loading jobs can by
provide by Snowflake managed service?
True
True or False: Compute resources used by Snowflake for data loading jobs can by
provide by hardware provisioned by user directly from cloud providers?
False
True or False: Stages are unique database objects in Snowflake?
True
True or False: Pipe are unique database objects in Snowflake?
True
True or False: Table are unique database objects in Snowflake?
False
Which approach would result in improved performance through linear scaling of data
ingestion workload:

A) Resize virtual warehouse


B) Consider the practice of organizing data by granular path
C) Consider the practice of splitting input file batch within the recommended range of
10MB and 100MB?
D) All of the above
D) All of the above
True or False: Snowflake Support Services addresses customer issues covering
troubleshooting failed queries?
True
True or False: Snowflake Support Services addresses customer issues covering
product usage questions?
True
True or False: Snowflake Support Services addresses customer issues covering
individual query syntax improvement?
True
True or False: Snowflake Support Services addresses customer issues covering 3rd
party application configuration support?
False
Snowflake users with support contracts that have a Severity-1 issue should contact
Snowflake in these ways except:

A) Snowflake Lodge - set appropriate severity (1-4)


B) Send email
C) 844-SNOWFLAKE
B) Send email
True or False: The Snowflake Lodge is a community site that has technical
information for Support Customers only?
False
True or False: The Snowflake Lodge is a community site that is the recommend
place to submit support cases?
True
True or False: The Snowflake Lodge is a community site that contains the most up to
date security alerts and product release information?
True
True or False: The Snowflake Lodge is a community site that does not allow
members to post questions?
False
Snowflake includes administration settings for resource consumption in order to:

A) Help control costs associated with unexpected credit usage of warehouses


B) Manage access to data warehouse for specified users
C) Maintain availability
A) Help control costs associated with unexpected credit usage of warehouses
True or False: Create Role can be granted within a Snowflake account by the
administrator?
True
True or False: Drop Role can be granted within a Snowflake account by the
administrator?
True
True or False: Create User permission can be granted within a Snowflake account
by the administrator?
True
True or False: Drop User permission can be granted within a Snowflake account by
the administrator?
True
True or False: Grant Privilege permission can be granted within a Snowflake account
by the administrator?
True
Snowflake provides specific administration features and capabilities to support the
following activities except:

A) Managing databases and warehouses within a Snowflake account


B) Managing roles and users within a Snowflake account
C) Monitoring Usage and manage resources to control costs in a Snowflake account
D) Manage 3rd party applications providing data to a Snowflake account
D) Manage 3rd party applications providing data to a Snowflake account
A defined data set is a point in time snapshot and can be updated by customers is?
Snowflake's Cloning
A defined data set can be shared with customers outside the Snowflake account is?
Snowflakes's Data Sharing
Query statement encryption is supported on ___________ accounts?

A) Standard
B) Enterprise
C) Enterprise for Sensitive (ESD)
D) Virtual Private Snowflake (VPS)
C) Enterprise for Sensitive (ESD)
True or False: Caching techniques are supported by Snowflake's performance
optimizing query methods?
True
True or False: B-tree type indexes are supported by Snowflake's performance
optimizing query methods?
False
True or False: Retrieving results of previous query from cache is supported by
Snowflake's performance optimizing query methods?
True
True or False: Snowflake security supports user-based access control?
True
True or False: Federated authentication in Snowflake is complaint with SAML 2.0?
True
True or False: Tri-secret requires that customers manage their own keys?
True
All security information is stored in the ___________ layer in the Snowflake
architecture?
A) Storage
B) Compute
C) Service
D) All of the above
C) Service
True or False: One benefit of client-side encryption is that it provides a secure
system for managing data in cloud storage?
True
True or False: One benefit of client-side encryption is that the data is encrypted
before loading into storage layer?
False
True or False: One benefit of client-side encryption is the storage service layer only
contains encrypted version of the data?
True
True or False: One benefit of client-side encryption is that queries can be encrypted
on the client side?
False
True or False: MFA (Multi-factor Authentication) is automatically enabled for your
account and available for all users to self-enroll?
True
True or False: MFA (Multi-factor Authentication) is an integrated feature powered by
Duo Security service?
True
True or False: MFA (Multi-factor Authentication) can be used for connecting to
Snowflake via the Snowflake JDBC driver?
True
True or False: MFA (Multi-factor Authentication) login is desired only for connecting
to Snowflake through the web interface?
False
Snowflake includes Role-Based Access Control to enable administrators to:

A) Limit access to data and privileges


B) Manage secure access to the Snowflake account and data
C) Establish role hierarchy and privilege inheritance to align access
D) All of the above
D) All of the above
With an IdP (identity provider) configured for your account, Snowflake supports using
SSO to connect and authenticate with ODBC Driver?
True
With an IdP (identity provider) configured for your account, Snowflake supports using
SSO to connect and authenticate with Python Connector?
True
With an IdP (identity provider) configured for your account, Snowflake supports using
SSO to connect and authenticate with JDBC Driver?
True
With an IdP (identity provider) configured for your account, Snowflake supports using
SSO to connect and authenticate with SnowSQL?
...
Choosing separate accounts in Snowflake enables users to have:

A) Different editions of Snowflake and different regions


B) Billing at the account level
C) Simpler database object deployment between environments
D) All of the above
D) All of the above
True or False: Different editions of Snowflake instances require separate accounts?
True
True or False: Snowflake instances in different regions require separate accounts?
True
True or False: Data can be shared READ ONLY across accounts, and can be cloned
accounts?
False
An enterprise view of data is useful because:

A) A data set can be stored once and shared multiple times


B) Data set provisioning is assigned to the owner of the data set
C) Data sets can be shared securely
D) All of the above
D) All of the above
True or False: Data Sharing is only supported between accounts in the same
Snowflake region?
True
True or False: A share can't be cloned by a consumer account, but the share data
CAN be copied into a table?
True
True or False: Data can be shared Read-Only across Snowflake accounts and can
also be cloned?
False
True or False: There are cases where separate accounts are required such as
different editions or regions?
True
Since Snowflake has several advantages over OnPrem data warehouse databases,
which of the following list are differentiators for Snowflake:

A) Single place for data, both structured and semi-structured


B) Minimal management
C) Instant and live data sharing
D) Pay a set monthly fee for DWaaS and support
E) Instant and unlimited on-demand scalability in both Storage and Compute
F) All of the above
A, B, C, & E
Select all layers that are part of Snowflake architecture:

A) Security
B) Compute
C) Memory
D) Storage
E) Query Optimization
F) Transaction Management
G) Cloud Services
H) Metadata Management
B, D, & G
Select all characteristics of Snowflake's Multi-Cluster environment:

A) Multiple virtual warehouses in a deployment


B) User has to specify which cluster each query will utilize
C) Individual warehouses automatically scale up and down base on query activity
D) Multi-cluster warehouses support all the same properties and actions as single-
cluster warehouse
E) All of the above
A, C, and D
Select the two types of modes that a Multi-Cluster Warehouses:

A) Statically
B) Dynamically
C) Maximized
D) Auto-Scale
F) None of the above
C&D
To help control the usage of credits in Auto-scale mode, Snowflake provides a
property, _________________, that determines the scaling policy to use when
automatically starting or shutting down additional clusters. Select the property:

A) Auto_Scale
B) Scaling_policy
C) Maximum_number_of_server_clusters
D) Minimum_number of server clusters
E) None of the above
B) Scaling_Policy
Standard Scaling Policy
Definition:
Prevents/minimizes queuing by favoring starting additional clusters over conserving
credits.

Cluster Starts....
Immediately when either a query is queued or the system detects that there's one
more query than the currently-running clusters can execute.
Cluster Shuts down....
After 2 to 3 consecutive successful checks (performed at 1 minute intervals), which
determine whether the load on the least-loaded cluster could be redistributed to the
other clusters without spinning up the cluster again.
Economy Scaling Policy
Definition:
Conserves credits by favoring keeping running clusters fully-loaded rather than
starting additional clusters, which may result in queries being queued and taking
longer to complete. (default setting)

Cluster Starts....
Only if the system estimates there's enough query load to keep the cluster busy for
at least 6 minutes.

Cluster Shuts down....


After 5 to 6 consecutive successful checks (performed at 1 minute intervals), which
determine whether the load on the least-loaded cluster could be redistributed to the
other clusters without spinning up the cluster again.
Semi-structured data strings are stored in a column with a Snowflake data type of
__________________?

A) Object
B) Character
C) Varchar
D) Variant
E) None of the above
D) Variant
Not A) Object - key pair values like Hadoop
When sizing a Snowflake warehouse, which of the following factors should not be
considered:

A) Number of users
B) Number of concurrent queries
C) Number of tables being queried
D) Data size and composition
E) All of the above
A) Number of Users - if users are not querying the database, then they don't take
any resources
Snowflake utilizes per _______________ billing.

A) Millisecond
B) Second
C) Minute
D) Hour
E) None of the above
B) Second
True or False: Each server in a cluster has a position. Servers are always removed
from the warehouse in reverse order of when they were added (aka LIFO, "Last In,
First Out")
True
True or False: The size of the cache is determined by the number of servers in the
all of warehouses for an account
False - size of the cache is determined by the number of servers in a warehouse.
Snowflake has three types of caching to optimize performance. Select the three
types of caches from the list:

A) Server
B) Warehouse
C) Client
D) Results
E) Security
F) Metadata
G) Database
H) User
B, D, and F
This type of cache lives on the Compute instance?

A) Server
B) Results
C) Metadata
D) Warehouse
D) Warehouse cache
This type of cache lives on the Cloud Services layer?

A) Server
B) Results
C) Metadata
D) Warehouse
C) Metadata
True or False: Warehousing cache stores the results of the queries that have been
executed for 24 hours unless the underlying data changes, at which point the entry is
invalidated?
False - Results Cache
True or False: Metadata cache is used to optimize queries and improve query
compile time?
True
True or False: Warehouse cache may be reset or invalidated if the Virtual
Warehouse is suspended and resumed?
True
True or False: Semi-structured data can be queried using SQL while incorporating
JSON path notation?
True
True or False: In general, you should try to match the size of the warehouse to the
expected size and complexity of the queries to be processed by the warehouse?
True
SQL functionality can be extended via (select all of the appropriate answers):

A) SQL User Functions (UDF)


B) Javascript UDFs
C) Session Variables
D) Only SQL and Javascript UDFs
E) None of
A, B, and C
True or False: A customer needs to have a preexisting account with AWS or
MicroSoft Azure in order to have a Snowflake account or to use Snowflake?
False - not a requirement
How many availability zones does Snowflake replicate to for disaster recovery and
high availability?

A) Zero
B) One
C) Three
D) As many as the user specifies in the configuration
C) Three - Snowflake automatically does this for each account. It is built-in and fully
managed by Snowflake
True or False: Snowflake only replicates Storage layer to the other availability zones
within a region?
False - Storage and Cloud Services layers are replicated
True or False: Multi-region accounts are supported by Snowflake?
False - Each Snowflake accounts is located in a single region
Snowflake is currently available on the following Cloud Providers:

A) AWS
B) Google
C) Microsoft's Azure
G) All of the above
A & C - Google is not support, but is coming in the future
True or False: Snowflake automatically partitions the data so that the user does not
need to define partition scheme?
True
Which of the following is not a characteristic of "automatic micro-partitions" in
Snowflake?

A) New partitions are created on logical properties


B) Avoids skews between partitions
C) Partitioning is automatically completed on the natural ingestion order of the data
D) The natural ingestion order maintains correlations between columns which could
be useful for pruning
A) New partitions are created on logical properties

Snowflake uses physical properties. The partitions can overlap in ranges.


True or False: Snowflake optimizes the storage for semi-structured data based on
the repeating elements within the semi-structured strings?
True
True or False: Snowflake store structure and semi-structure data in different
proprietary file formats?
False - they use the same proprietary file format
Snowflake has two key features in their storage architecture. They are (select two):

A) Time Travel
B) Time Machine
C) Time Warp
D) Snapshot Cloning
E) Zero-Copy Cloning
F) Replication
A&E
Select the best description of Zero-Copy Cloning:

A) Metadata-only operation
B) No replication of data
C) Unmodified data stored once; modified data stored as new blocks
D) All of the above
D) All of the above
Which is not a characteristic of Time Travel?

A) Protection from accidental data operations


B) Recover data with the cost of running backups
C) Previous versions of data automatically retained
D) Retention period selected by customers (up to 90 days for Enterprise edition)
B) Recover data with the cost of running backups

There is no cost in Snowflake for running backups, purchasing additional hardware,


incurring downtime and overhead of additional administration
True or False: When defining columns to contain dates or timestamps, Snowflake
recommend choosing a date or timestamp data type rather than a character data
type?
False - Snowflake stores DATE and TIMESTAMP data more efficiently than
VARCHAR, resulting in better query performance.
True or False: Referential integrity constraints in Snowflake are enforced?
False - Referential integrity constraints in Snowflake are informational are not
enforced.
True or False: Since integrity constraints are not enforced, the developers should not
the creation of primary and foreign keys.
False - The primary keys and foreign keys enable members of your project team to
orient themselves to the schema design and familiarize themselves with how the
tables relate with one another. And most business intelligence (BI) and visualization
tools import the foreign key definitions with the tables and build the proper join
conditions. This approach saves you time and is potentially less prone to error than
someone later having to guess how to join the tables and then manually configuring
the tool.
True or False: Reclustering a small table typically doesn't improve query
performance significantly?
True
When might you might consider specifying a clustering key and reclustering the
table?

A) When a small is get out of sequence with the clustering key


B) The order in which the data is loaded does not match the dimension by which it is
most commonly queried
C) To reorder a table to match another clustering key in another table that the users
perform frequent joins
D) All of the above
E) None of the above
B) The order in which the data is loaded does not match the dimension by which it is
most commonly queried
True or False: Zero-Copy cloning allow a customer to provision real, Production data
for development and test environments without physically copying the data?
True
True or False: Snowflake storage cost are based only on a daily average of all
database compressed data storage?
False - Database data and data stored for Time Travel retention and failsafe.
True or False: A Snowflake Role is the only thing that connects Storage (Database)
to Compute (Virtual Warehouse)?
True
True or False: Snowflake provides embedded multi-factor authentication across
some of its editions?
False - multi-factor authentication is embedded in all editions
Which security features are provided as part of Enterprise editions (select all that
apply)?

A) Snowflake data encrypted using AES 256


B) Support for encrypting data using customer-managed keys
C) Periodic rekeying of encrypted data
D) Support for HIPAA
D) Support for PCI DSS compliance
F) Support for user SSO through federated authentication
A, C, and F
Data Sharing Providers will have to:

A) Incur the cost of the data storage - normal rates apply


B) May share a data set with an unlimited number of accounts
C) May set up and manage "Reader Accounts" for consumers who are not already
Snowflake customers
D) All of the above
D) All of the above
Data Sharing Consumers will have to:

A) Incur the cost of Compute (virtual warehouse)


B) Must create a database based on the share using the ACCOUNTADMIN role
C) Can query shared objects in the same query that they query their own objects
D) All of the above
D) All of the above
All statements are true about Data (Storage) except:

A) All data in Snowflake is maintained in databases


B) Each database consists of one or more schemas , and, within these schemas,
one or more tables and/or views
C) Schemas can be thought of as a physical grouping of database objects
D) Snowflake does not place any hard limits on the number of databases, schemas
(within a database), or objects (within a schema) you can create
C) Schemas can be thought of as a physical grouping of database objects

Logical grouping not physical grouping


True or False: Virtual Warehouse can be configured to auto-suspend after a specific
period of inactivity, or auto-resume as soon as the administrator cancels the auto-
suspend?
False - Auto-Resume restarts the virtual warehouse as soon as a user requests an
operation that requires Compute resources.
Upon startup or auto-resume of a Virtual Warehouse, how much time is
automatically billing to the account:

A) One minute
B) Two minutes
C) Five minutes
D) Accounts are always billed by the second
E) None of the above
A) One minute
After the one minute - accounts are billed by the second
True or False: Virtual Warehouse in Snowflake is where the data is stored?
False - Virtual Warehouses is the Compute operation
Snowflake supports data in VARIANTs up to a maximum size of:

A) 4 MB uncompressed
B) 8 MB uncompressed
C) 16 MB compressed
D) 32 MB compressed
E) None of the above
C) 16 MB compressed
Non-native values such as dates and timestamps are stored as strings when loaded
into a VARIANT column so which statements are true:

A) Operations on these values could be slower


B) These columns would consume more space than when stored in a relational
column with the corresponding data type
C) Both A and B are true
D) Neither A or B are true
C) Both A and B are true
The Snowflake UI is divided into for basic areas. Which of the following areas is not
part of the UI:

A) Query
B) Databases
C) Warehouse
D) History
A) Query
Worksheet is the name of the query section
True or False: The PUT and GET commands can be executed via the Snowflake UI?
False - These commands can only be executed using SNOWSQL client. They are
not supported by ODBC driver.
The commands for loading data into Snowflake are:

A) COPY & PUT


B) COPY & INSERT
C) INSERT & PUT
D) None of the above
B) COPY & INSERT
True or False: COPY statement allows insert on SELECT against a staged file, and a
WHERE clause can be used?
False - INSERT command allows the Where clause, not the COPY command
True or False: The COPY command is more performant than the INSERT
statement?
Trie
To load data into Snowflake, what needs to be in place (check all that apply)?

A) Virtual Warehouse
B) Predefined target table
C) Staging location with data staged
D) File Format
A, B, C, and D
True or False: Snowpipe is a continuous data ingestion service that detects and
loads streaming data?
True
Which of the following is not a feature of Snowpipe:

A) The service can load data from any internal or external stage
B) Snowpipe has a server-less compute model
C) The service provides REST endpoints and uses Snowflake provided compute
resources to load the data and retrieve history reports
D) Snowpipe loads data after it is stage and the user executes the LOADDATA
command
D) Snowpipe loads data after it is stage and the user executes the LOADDATA
command

No LOADDATA command - and the data is automatically loaded within minutes after
the files are added to the stage
Name all of the file/data types that Snowflake support for data loading?
Text Delimited (CSV, TAB, etc.)
JSON
XML
Avro,
Parquet
ORC
Which layer does Snowflake store the various statistics for databases, tables,
columns, and files?

A) Storage
B) Compute
C) Cloud Services
D) Snowflake does not store statistics
C) Cloud Services - the metadata is stored in this layer
True or False: Can a query run against Result Cache with no Virtual Warehouse
running and retrieve results (assuming the query is cached)?
True
______________ is used to analyze the execution details of a query?

A) Statistics
B) Metadata
C) Query Plan
D) Query Profile
D) Query Profile
True or False: A query plan can only be used for completed queries?
False - it can be used for both in-progress and completed queries
Which feature does not come with the Query Profile?

A) Graphical representation of the main components of the processing plan for the
query
B) Details and statistics for the overall query
C) Hints for improving the query performance
D) Statistics for each component of the query
C) Hints for improving the query performance
Which proven data modeling techniques does Snowflake support (check all that
apply)?

A) 3rd Normal Form


B) Data Vault
C) Star Schema
D) Snowflake Schema
E) All of the above
E) All of the above
True or False: The use case and audience drives the selection of the proven data
modeling techniques?
True
True or False: Snowflake enforces all constraints?
False - Snowflake only enforces NOT NULL constraint
True or False: A best practice of load and store Semi-structured data in Snowflake is
to parse the semi-structure string into structured columns on source data load?
False - the data should be loaded and stored in a VARIANT data type.
What is the best practice for handling semi-structured data with 3rd party BI tools?

A) Use the BI tool to create a metadata object to view the column


B) ETL the column with other columns that are part of the query into a structured
table
C) Create a Snowflake view that parse the semi-structured column into structure
columns for the BI tool to retrieve
D) All of the above
C) Create a Snowflake view that parse the semi-structured column into structure
columns for the BI tool to retrieve
True or False: The number of COPY operations that run in parallel can exceed the
number of data files to be loaded?
False - it cannot exceed - there are no files to copy
Which of the following is not a best practice for loading data?

A) Splitting large files into a greater number of smaller files distributes the load
among the servers in an active warehouse, thereby increasing performance.
B) Split files by line to avoid records that span chunks
C) Setting all of the load files to the optimal size of 10 to 100 MB in compressed size
D) The number of data files that are processed in parallel is determined by the
number and capacity of servers in a warehouse
E) All of the above
E) All of the above
True or False: users control the file split and size of data being load and the how the
data is divided into micro-partitions?
False - Snowflake determines how the data is divided into micro-partitions. Users
cannot create or configure these partitions.
True or False: Micro-partitions are immutable?
True - subsequent changes of any type to the data will be written to additional micro-
partitions
A Snowflake mechanism that is used to limit the number of micro-partitions scanned
by a query is called ________?

A) Governor
B) Regulator
C) Pruning
D) None of the above
C) Pruning
Database designer should consider using Clustering Keys when (check all that
apply):

A) The tables is very large (multi TB)


B) Table is large enough to reside on many micro-partitions
C) Columns in the table can provide sufficient filtering to select a subset of these
micro-partitions
D) All of the above
D) All of the above

You might also like