CDP Public Cloud Preview Features

The information in these pages is released as part of a preview for the features described. Access to preview features is provided upon request to customers for trial and evaluation. The components are provided ‘as is’ without warranty or support. Further, Cloudera assumes no liability for the use of preview components, which should be used by customers at their own risk. Please contact your Cloudera account team to have a preview feature enabled in your CDP account.

Data Hub

Fine-grained Access Control from ABFS File Browser in Hue
published: 2021-10-21; modified: 2021-10-21
Learn how to enable fine-grained access to ADLS Gen2 containers from the ABFS File Browser and Importer in Hue.
Fine-grained Access Control from S3 File Browser in Hue
published: 2021-10-21; modified: 2021-10-21
Learn how to enable fine-grained access to S3 buckets from the S3 File Browser and Importer in Hue.
Upgrading Data Hubs
published: 2021-10-29; modified: 2021-12-17
You can upgrade a Data Hub cluster in one of three ways: Runtime and Cloudera Manager major/minor version upgrades, maintenance/“hotfix” upgrades, and OS upgrades.

Data Warehouse

Add Access to External S3 Buckets for CDW Clusters on AWS
published: 2021-05-12; modified: 2021-05-25
Learn how to use the CDW UI to add access to external S3 buckets for CDW environments that run on AWS.
Azure Spot instances for Virtual Warehouses
published: 2021-09-28; modified: 2021-09-28
Cloudera Data Warehouse (CDW) now supports using Azure spot instances for Virtual Warehouses to reduce costs if you do not need fault tolerance.
Configure Impala Virtual Warehouses on AWS Environments to Spill to S3
published: 2021-06-14; modified: 2021-06-14
Impala Virtual Warehouses on AWS environments can now be configured to write temporary data (spill) to S3 by specifying the S3 URI when you are creating the Virtual Warehouse.
Enable SSO for JDBC/ODBC Connections to Virtual Warehouses
published: 2021-05-21; modified: 2021-05-25
Enable single sign-on (SSO) for third-party BI tool connections to Virtual Warehouses that use JDBC and ODBC.
Enabling Multi-tenancy in Cloudera Data Warehouse
published: 2021-06-02; modified: 2021-06-02
Achieve tenant isolation by creating a multi-tenant environment in CDW.
Hue: The Next Generation SQL Assistant for Hive in CDW
published: 2021-06-02; modified: 2021-06-10
The one-stop SQL assistant for Hive/LLAP workloads in CDW with combined capabilities of Data Analytics Studio (DAS) and Hue.
Managed storage access for AWS
published: 2021-10-21; modified: 2021-10-21
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for AWS.
Managed storage access for Azure
published: 2021-10-21; modified: 2021-10-21
Understand how Cloudera Data Warehouse (CDW) stores data for multiple tenants and how to set up a managed storage warehouse for Azure.
Specifying Custom Environment Names in Cloudera Data Warehouse
published: 2021-04-14; modified: 2021-04-14
Learn how to set custom environment names for your AWS or Azure cloud resources in CDW.
Visualizing Data in Cloudera Data Warehouse Public Cloud
published: 2021-08-27; modified: 2021-08-27
CDW integrates Data Visualization for building graphic representations of data, dashboards, and visual applications.

Governance

Integrating CDP Data Catalog with AWS Glue Data Catalog
published: 2021-08-09; modified: 2021-12-08
While using AWS Glue in Data Catalog, you will be able to experience a complete snapshot metadata view, along with other visible attributes that can power your data governance capabilities.
Navigating to tables and databases in Hue using Data Catalog
published: 2021-08-07; modified: 2021-08-07
The integration between Data Catalog and Cloudera Data Warehouse (CDW) service provides a direct web link to the Hue instance from the Data Catalog web UI, making it easy to navigate across services.
Support for CDP Private Cloud Base clusters in Data Catalog
published: 2021-08-06; modified: 2021-08-06
Data Catalog now supports discovering and profiling assets that reside in CDP Private Cloud Base clusters.
Supporting High Availability for Profiler services
published: 2021-08-07; modified: 2021-08-07
The Data Catalog profiler services is now supported by enabling the High Availability (HA) feature.

Machine Learning

CMK Encryption on AWS
published: 2021-08-10; modified: 2021-11-30
Cloudera Machine Learning on AWS is now able to use a Customer Master Key (CMK) to encrypt data.
Experiments with MLflow
published: 2021-10-27; modified: 2021-10-27
Cloudera Machine Learning now supports the MLflow tracking API and makes use of the MLflow client library as the default method to log experiments.
ML Discovery & Exploration
published: 2021-08-31; modified: 2021-09-16
Cloudera Machine Learning Discovery and Exploration accelerates the ML development workflow with preconfigured data connections and readily available code snippets.
Private Cluster Support
published: 2022-01-06; modified: 2022-01-06
Private Clusters provide a simple way to create a secure cluster, where the API server and the workloads themselves only rely on private IP addresses that are not accessible from the internet.

Management Console

FreeIPA Upgrade
published: 2022-01-06; modified: 2022-01-06
In order to make sure that your FreeIPA nodes are running with the latest patches, you should periodically upgrade your FreeIPA cluster.
Data Lake Upgrade
published: 2021-12-03; modified: 2021-12-17
When new versions of Cloudera Runtime/Cloudera Manager are available for the Data Lake service, you can initiate a Data Lake upgrade. You may also have the option to upgrade to a new OS image.
Deploying CDP in Multiple AWS Availability Zones
published: 2021-12-07; modified: 2021-12-07
By default, CDP provisions Data Lake, FreeIPA and Data Hubs in a single Availability Zone (AZ), but you can optionally choose to deploy them across multiple Availability Zones (multi-AZ).
Public Endpoint Access Gateway for Azure
published: 2021-07-23; modified: 2021-07-27
You can enable Public Endpoint Access Gateway for Azure during Azure environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.
Public Endpoint Access Gateway for GCP
published: 2021-12-17; modified: 2021-12-17
You can enable Public Endpoint Access Gateway for GCP during GCP environment registration after enabling Cluster Connectivity Manager (CCM). Once activated, the gateway will be used for the Data Lake and all the Data Hubs within the environment.
Using Customer Managed Encryption Keys for Encrypting GCP Disks and Databases
published: 2021-11-17; modified: 2021-11-17
Use Customer Managed Encryption Keys (CMEK) for encrypting attached disks and databases used by CDP environments (Data Lake and FreeIPA) and Data Hubs running on GCP.
Using Customer Managed Keys for Encrypting Azure Managed Disks and Databases
published: 2021-05-13; modified: 2021-12-02
Use Customer Managed Keys (CMK) for encrypting attached disks and databases used by CDP environments (Data Lake and FreeIPA) and Data Hubs running on Azure.
Using Customer Managed Keys for Encrypting EBS Volumes and RDS on AWS
published: 2021-12-03; modified: 2021-12-17
Use Customer Managed Keys (CMK) for encrypting Amazon Elastic Block Store (EBS) volumes used by Data Lake, FreeIPA, and Data Hubs.
Workload Password Policies
published: 2021-04-27; modified: 2021-10-28
In order to bring your workload password complexity requirements in line with company policy, you can manage your FreeIPA password policies via CDP CLI.