Practice Free 250-608 Exam Online Questions
In Greenplum, a ___ is a logical structure that contains tables, views, indexes, and other database objects.
- A . segment
- B . role
- C . schema
- D . master
What is the primary function of the gprecoverseg utility?
- A . To reassign user privileges
- B . To balance disk usage across segments
- C . To recover failed segments from mirrors
- D . To upgrade all segment PostgreSQL instances
In PostGIS, what is the typical data type used to store spatial shapes like points, lines, and polygons?
- A . VARCHAR
- B . BYTEA
- C . GEOMETRY
- D . XML
A data analyst wants to train a logistic regression model in Greenplum to predict customer churn. They’ve structured a dataset with is_churned as the target and several numeric features.
Which steps should they take to complete model training using MADlib?
- A . Export the data to a CSV file and train using scikit-learn
- B . Use a SELECT statement to call madlib.logregr_train() on the dataset
- C . Use gpload to load the model weights
- D . Create a writable external table to store model predictions
When using gpload, which two operations can you configure in the YAML control file? (Choose two)
- A . Index creation
- B . Error log directory
- C . Field delimiter specification
- D . Table vacuuming
What is the primary benefit of using external readable tables in Greenplum?
- A . They support transaction rollback
- B . They reduce disk usage by avoiding data duplication
- C . They replicate data to all segment hosts
- D . They compress data before load
Which statement is true about using gpload for loading data into Greenplum?
- A . It requires the source files to be in a database-compatible binary format
- B . It must be run from the master host
- C . It uses a YAML configuration to define load parameters
- D . It loads data serially for each segment
Which Greenplum utility is used to redistribute table data across segments based on a new distribution policy?
- A . gpexpand
- B . gptransfer
- C . gpdist
- D . gpmapreduce
Which of the following is the primary purpose of the Greenplum Command Center (GPCC)?
- A . Creating distributed table schemas
- B . Managing physical segment storage
- C . Monitoring and administering Greenplum clusters through a web UI
- D . Running data science workflows and Jupyter notebooks
What are two advantages of using GPCC over command-line monitoring tools? (Choose two)
- A . Centralized visualization of cluster activity
- B . Higher privileges for OS-level configuration
- C . Real-time query tracking and cancellation
- D . Ability to write raw SQL directly to disk
