Practice Free DA0-002 Exam Online Questions
The following SQL code returns an error in the program console:
SELECT firstName, lastName, SUM(income)
FROM companyRoster
SORT BY lastName, income
Which of the following changes allows this SQL code to run?
- A . SELECT firstName, lastName, SUM(income) FROM companyRoster HAVING SUM(income) > 10000000
- B . SELECT firstName, lastName, SUM(income) FROM companyRoster GROUP BY firstName, lastName
- C . SELECT firstName, lastName, SUM(income) FROM companyRoster ORDER BY firstName, income
- D . SELECT firstName, lastName, SUM(income) FROM companyRoster
A data analyst troubleshoots a dashboard every day for a week.
Which of the following techniques best addresses how to validate the data moving forward?
- A . Inquiring about structure changes
- B . Setting up monitoring alerts
- C . Reaching out to users daily
- D . Rebuilding the dashboard
A data analyst wants to find the mean score that students received in science classes.
Given the following tables:

Which of the following will produce the desired results?
- A . COUNT and GROUP
- B . SUM and GROUP
- C . average and where
- D . CONCAT and WHERE
A data analyst wants to analyze sales data for possible customer patterns.
Which of the following should the analyst use to complete this task?
- A . Standardizing
- B . Clustering
- C . Merging
- D . Scaling
A data analyst receives a request for the current employee head count and runs the following SQL statement:
SELECT COUNT(EMPLOYEE_ID) FROM JOBS
The returned head count is higher than expected because employees can have multiple jobs.
Which of the following should return an accurate employee head count?
- A . SELECT JOB_TYPE, COUNT DISTINCT(EMPLOYEE_ID) FROM JOBS
- B . SELECT DISTINCT COUNT(EMPLOYEE_ID) FROM JOBS
- C . SELECT JOB_TYPE, COUNT(DISTINCT EMPLOYEE_ID) FROM JOBS
- D . SELECT COUNT(DISTINCT EMPLOYEE_ID) FROM JOBS
While interacting with a data set, a data analyst learns that copies of the data are distributed across several data repositories around the globe.
Which of the following concepts best describes this practice?
- A . Redundancy
- B . High availability
- C . Replication
- D . Duplication
A data company needs a visualization that shows the availability zones from the last ten years and any future availability zones that the company will be using in the next five years.
Which of the following is the most appropriate visualization to display this information?
- A . Bar chart
- B . Mosaic plot
- C . Map
- D . Pie chart
A data analyst needs to join together a table data source and web API data source using Python.
Which of the following is the best way to accomplish this task?
- A . Convert the data from the API and database to a varchar format and convert them to pandas DataFrames that are then merged together.
- B . Convert the data from the API and database to a JSON format and convert them to pandas DataFrames that are then merged together.
- C . Convert the data from the API and database to a TXT format and convert them to pandas DataFrames that are then merged together.
- D . Convert the data from the API and database to a string format and convert them to pandas DataFrames that are then merged together.
An analyst needs to produce a final dataset using the following tables:


The expected output should be formatted as follows:
| CourseID | SectionNumber | StudentID | FirstName | LastName |
Which of the following actions is the best way to produce the requested output?
- A . Aggregate
- B . Join
- C . Group
- D . Filter
An analyst is building a reporting deck. The deck must include tracking and visualizing metrics and row-level security.
Which of the following actions should the analyst take after meeting all of the requirements?
- A . Show a mock-up to the team.
- B . Explain the desired level of reporting detail.
- C . Present an analysis of the data.
- D . Find out the project due date.
