DataDios Data Source
Overview
Data Sources in DataDios are configured connections to external databases, APIs, files, and other data systems. They serve as the foundation for all data operations within the platform, enabling you to connect to, explore, and manage various types of data repositories.
Accessing Data Sources
- Navigate to the Data Source section from the left navigation menu
- Click on Sources to view all available data sources
- The main interface displays a table with columns for Name, Ownership, Type, and Actions
Creating a New Data Source
Step 1: Access Creation Dialog
- Click the + CREATE SOURCE button in the top-right corner of the Data Sources page
- The "Create Data Sources" dialog will open
Step 2: Select Data Source Type
- In the "Select DS type" dropdown, choose from available options:
- API HUB: For API-based data connections
- Data Quality: For data quality monitoring sources
- Database: For traditional database connections
- Folder: For file-based data sources
- Gen AI Models: For AI model integrations
- Payment Gateway: For payment system connections
- Secret Store: For secure credential storage
- Service: For web service connections
Step 3: Configure Data Source
- Name: Enter a descriptive name for your data source
- Connection Parameters: Fill in the required configuration details based on the selected type
- Choose between Form or Json input methods using the toggle buttons
Step 4: Save and Test
- Click Save to create the data source
- Use Test Connection to verify the configuration is working correctly
Importing Data Sources
Using the Import Feature
- Click the IMPORT button in the top navigation
- Select your import file (typically JSON or configuration file)
- Follow the import wizard to complete the process
- Verify imported data sources appear in the main list
Managing Existing Data Sources
Viewing Data Source Details
- Click the arrow (▷) next to any data source name to expand and view sub-items
- For hierarchical data sources like "Gen AI Models", you'll see individual items listed with their ownership and type information
Updating Data Source Configuration
Step 1: Access Update Dialog
- Click the edit icon (pencil) in the Actions column for the desired data source
- The "Update Data Source" dialog will open showing current configuration
Step 2: Modify Settings
Available configuration options include:
- project_name: Update the associated project
- model_type: Change the model type (e.g., gpt-4o)
- embedding_model: Modify embedding settings
- schedule_sync: Adjust synchronization schedule
- Select: Update selection parameters
- API_KEY: Update authentication credentials
- secret_name: Modify secret store references
Step 3: Apply Changes
Use the available action buttons:
- COPY: Duplicate the current configuration
- DELETE: Remove the data source (use with caution)
- SAVE: Apply and save changes
- TEST CONNECTION: Verify updated configuration works
Data Explorer
Accessing Data Explorer
- Click on any data source name to open the Data Explorer
- The explorer provides multiple views of your data source information
Available Views
Object Meta Data Tab
- Attributes: View data structure and column information
- Constraints: See data constraints and relationships
- Indexes: Review database indexes and performance optimizations
- Partitions: Understand data partitioning strategies
The main table displays:
- ATTR_SNO: Attribute sequence number
- ATTR_NAME: Column or attribute names
- DATA_TYPE: Data types (VARCHAR, INTEGER, BOOLEAN, etc.)
- DATA_LENGTH: Maximum length constraints
- DATA_SCALE: Precision settings
- NULLABLE: Whether null values are allowed
- GENERIC_TYPE: Standardized data type classification
Object Data Tab
- View actual data content from the data source
- Browse records and understand data patterns
- Useful for data validation and exploration
Graph Tab
- Visual representation of data relationships
- Understand data lineage and dependencies
- Explore connections between different data elements
Data Quality Rules Tab
- Review configured data quality rules
- Monitor data quality standards
- Set up validation criteria
Data Quality Score Tab
- View overall data quality metrics
- Track data quality trends over time
- Identify areas for improvement
Best Practices
Data Source Configuration
- Use Descriptive Names: Choose clear, meaningful names for easy identification
- Test Connections: Always test connections after configuration changes
- Secure Credentials: Use the Secret Store for sensitive information like API keys
- Regular Updates: Keep connection parameters current as systems change