👨‍🔧
The Ops Compendium
  • The Ops Compendium
  • Definitions
    • Ops Definition Comparisons
  • ML & DL Compendium
  • MLOps
    • MLOps Intro
    • MLOps Teams
    • MLOps Literature
    • MLOps Course
    • MLOps Patterns
    • ML Experiment Management
    • ML Model Monitoring & Alerts
    • MLOps Tools
    • MLOps Deployment
    • Feature Stores & Feature Pipelines
    • Model Formats
    • AI As Data
    • MLOps Interview Questions
    • ML Architecture
  • DataOps
    • SQL
    • Tools
    • Databases
    • Database Modeling
    • Data Analytics
    • Data Engineering
    • Data Pipelines
    • Data Strategy
    • Data Vision
    • Data Teams
    • Data Catalogs
    • Data Governance
    • Data Quality
    • Data Observability
    • Data Program Management
    • Data KPIs
    • Data Mesh
    • Data Contract
    • Data Product
    • Data Engineering Questions & Training
    • Data Patterns
    • Data Architecture
    • Data Platforms
    • Data Lineage
  • DevOps
    • DevOps Strategy
    • DevOps Tools
      • Tutorials
      • Continuous Integration
      • Docker
      • Kubernetes
      • Cloud Objects
      • Key Value DB
      • API Gateway
      • Infrastructure As code
      • Logs
      • ELK
      • SLO
    • DevOps Courses
  • DevSecOps
    • Definitions
    • Tools
    • Concepts
  • Architecture
    • Problems
    • Development Concepts
    • System Design
Powered by GitBook
On this page

Was this helpful?

Edit on GitHub
  1. DataOps

Tools

PreviousSQLNextDatabases

Last updated 2 years ago

Was this helpful?

  1. - an open source distributed platform for change data capture

  2. - "Hudi is a rich platform to build streaming data lakes with incremental data pipelines on a self-managing database layer, while being optimized for lake engines and regular batch processing."

  3. - "Continuous SQL Pipelines for Cloud Data Lakes. No custom coding. No orchestration. No infrastructure maintenance."

  4. - "dbt helps data teams work like software engineers—to ship trusted data, faster. collaboratively deploy analytics code following software engineering best practices like modularity, portability, CI/CD, and documentation. Now anyone who knows SQL can build production-grade data pipelines."

  5. - A simplified, lightweight ETL Framework based on Apache Spark

  6. BI tools that directly connect to a DB.

    1. - Connect and query your data sources, build dashboards to visualize data and share them with your company.

    2. - "" - by fullstackgrowth.com

    3. - Apache Superset is a modern data exploration and visualization platform

  7. - Stitch rapidly moves data from 130+ sources into a data warehouse so you can get to answers faster, no coding required.

  8. - Generate across all platforms and channels in a common format, with the Snowplow Behavioral Data Platform.

  9. - A SINGLE PLATFORM FOR INTEGRATION & WORKFLOW AUTOMATION ACROSS YOUR ORGANIZATION

  10. - Test data quality at scale

Debezium
Hudi
Upsolver
DBT
intro
in depth intro
dbt in one hour
CI/CD with dbt
snowflake terraform and dbt
hubspot snowflake and dbt
Metorikku
redash
Metabase
is an easy-to-use, open source business intelligence tool that lets you analyze data from a variety of data destinations and sources. It also follows a simple and fast setup process. Its data visualization capabilities are exceptional and can be showcased in a user-friendly way, without using SQL. With Metabase, you can easily share live dashboards, automated reports, and questions with the rest of your team.
Superset
Stitch
SnowPlow
complete, accurate and well-structured event data
Workato
AWS Deequ