Capability
12 artifacts provide this capability.
Want a personalized recommendation?
Find the best match →via “dataframe-aware transformations with column-level lineage”
Python DAG micro-framework for data transformations.
Unique: Implements column-level lineage tracking for dataframe transformations by analyzing function operations and building a fine-grained dependency graph, providing visibility into which raw columns contribute to each feature without requiring explicit lineage annotations
vs others: More detailed than Airflow's task-level lineage because it tracks column-level dependencies, and more practical than manual lineage documentation because it's automatically inferred from transformation code
via “column-level lineage tracking and visualization”
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Unique: Column-level lineage extraction from SQL, dbt, and Spark with automatic DAG construction and interactive visualization, rather than table-level lineage only; integrates lineage extraction into the ingestion pipeline itself
vs others: Deeper than Collibra's table-level lineage because it tracks individual column transformations; more automated than manual lineage tools because it parses transformation logic directly
Hi HN, I'm Hugo. I've been building Rocky over the past month, shipping fast in the open. The binary is on GitHub Releases, `dagster-rocky` on PyPI, and the VS Code extension on the Marketplace. I held off on a broader announcement until the trust-system surface was coherent enough to talk
Unique: The lineage tracking is integrated at the query parsing level, providing real-time insights into data transformations without additional tooling.
vs others: More comprehensive than traditional lineage tools, which often require separate integrations or manual tracking.
via “column-level data lineage tracking and visualization”
OpenMetadata is a unified metadata platform for data discovery, data observability, and data governance powered by a central metadata repository, in-depth column level lineage, and seamless team collaboration.
Unique: Implements column-level (not table-level) lineage tracking with explicit edge storage in the metadata repository, enabling precise impact analysis and data quality root-cause tracing — most competitors only track table-level lineage
vs others: Provides finer-grained lineage than Collibra or Alation (which typically stop at table level), enabling data engineers to identify exactly which source columns caused downstream data quality issues
via “column-level lineage and data type tracking”
** - MCP server for dbt-core (OSS) users as the official dbt MCP only supports dbt Cloud. Supports project metadata, model and column-level lineage and dbt documentation.
Unique: Extracts column-level lineage from dbt manifest contracts and test metadata, enabling fine-grained tracking of data transformations. Combines column definitions, test associations, and data type information into unified lineage graph without requiring SQL parsing.
vs others: Provides column-level detail that simple model lineage cannot offer, and requires no external data catalog or SQL parsing — all information comes from dbt artifacts.
via “data lineage and dependency tracking”
Transcend MCP Server — Data Discovery tools.
Unique: Exposes data lineage as queryable MCP tools rather than static visualizations, enabling LLMs to perform programmatic lineage analysis, impact assessment, and compliance checks without human interpretation of lineage diagrams
vs others: Unlike traditional data lineage tools that produce static reports, this makes lineage queryable and actionable through the MCP protocol, enabling automated reasoning about data dependencies
via “data lineage tracking”
Data Processing & ETL infrastructure for Generative AI applications
Unique: Utilizes a comprehensive metadata management system that captures detailed lineage information, making it easier to comply with regulatory requirements compared to simpler tracking methods.
vs others: More detailed than basic lineage tracking in tools like Apache Atlas, as it captures every transformation step and its impact on data quality.
via “dataset-versioning-and-lineage”
via “lineage tracking and impact analysis”
via “audit-trail-and-model-lineage-tracking”
via “dataset-versioning-and-lineage-tracking”
via “dataset versioning and lineage tracking”
Building an AI tool with “Column Lineage Tracking”?
Submit your artifact →curl unfragile.ai/agents.md | sh© 2026 Unfragile. The platform for software for agents.