DataHub Releases
Summary
Version | Release Date | Links |
---|---|---|
v0.15.0rc3 | 2024-12-11 | Release Notes, View on GitHub |
v0.14.1 | 2024-09-17 | Release Notes, View on GitHub |
v0.14.0.2 | 2024-08-21 | View on GitHub |
v0.14.0 | 2024-08-13 | View on GitHub |
v0.13.3 | 2024-05-23 | View on GitHub |
v0.13.2 | 2024-04-16 | View on GitHub |
v0.13.1 | 2024-04-02 | View on GitHub |
v0.13.0 | 2024-02-29 | View on GitHub |
v0.12.1 | 2023-12-08 | View on GitHub |
v0.12.0 | 2023-10-25 | View on GitHub |
v0.11.0 | 2023-09-08 | View on GitHub |
v0.10.5 | 2023-08-02 | View on GitHub |
v0.10.4 | 2023-06-09 | View on GitHub |
v0.10.3 | 2023-05-25 | View on GitHub |
v0.10.2 | 2023-04-13 | View on GitHub |
v0.10.1 | 2023-03-23 | View on GitHub |
v0.10.0 | 2023-02-07 | View on GitHub |
v0.9.6.1 | 2023-01-31 | View on GitHub |
v0.9.6 | 2023-01-13 | View on GitHub |
v0.9.5 | 2022-12-23 | View on GitHub |
v0.9.4 | 2022-12-20 | View on GitHub |
v0.9.3 | 2022-11-30 | View on GitHub |
v0.9.2 | 2022-11-04 | View on GitHub |
v0.9.1 | 2022-10-31 | View on GitHub |
v0.9.0 | 2022-10-11 | View on GitHub |
v0.8.45 | 2022-09-23 | View on GitHub |
v0.8.44 | 2022-09-01 | View on GitHub |
v0.8.43 | 2022-08-09 | View on GitHub |
v0.15.0rc3
Released on 2024-12-11 by @RyanHolstien.
What's Changed
- fix(ingest): ensure sentry is initialized with graph tags by @hsheth2 in https://github.com/datahub-project/datahub/pull/11949
- fix(ingest): more error handling by @anshbansal in https://github.com/datahub-project/datahub/pull/11969
- feat(datahub-gc): add truncation days param by @david-leifker in https://github.com/datahub-project/datahub/pull/11967
- docs(release): Update v_0_3_7.md by @david-leifker in https://github.com/datahub-project/datahub/pull/11937
- fix(ci): fix build-and-test by @david-leifker in https://github.com/datahub-project/datahub/pull/11974
- refactor(ingest/powerbi): organize code within the module based on responsibilities by @sid-acryl in https://github.com/datahub-project/datahub/pull/11924
- fix(schematron): fix for jdk8 by @david-leifker in https://github.com/datahub-project/datahub/pull/11975
- fix(automations docs): Update snowflake-tag-propagation.md to include permissions required for the Automation by @jjoyce0510 in https://github.com/datahub-project/datahub/pull/11977
- chore(bump): bump version of akka for datahub-frontend by @david-leifker in https://github.com/datahub-project/datahub/pull/11979
- feat(ingestion): extend feast plugin to ingest tags and owners by @margaridafernandes-trip in https://github.com/datahub-project/datahub/pull/11784
- fix(validation): additional URN validation adjustments by @david-leifker in https://github.com/datahub-project/datahub/pull/11973
- feat(search): Update search_config.yaml by @david-leifker in https://github.com/datahub-project/datahub/pull/11971
- docs(release): update recommended CLI by @anshbansal in https://github.com/datahub-project/datahub/pull/11986
- fix(ingest/kafka):add poll for admin client for oauth_cb by @mayurinehate in https://github.com/datahub-project/datahub/pull/11985
- fix(ingestion/iceberg): Improvements to iceberg source by @skrydal in https://github.com/datahub-project/datahub/pull/11987
- feat(ingest): standardize sql type mappings by @hsheth2 in https://github.com/datahub-project/datahub/pull/11982
- feat(ingest): bump typing_extensions dep by @hsheth2 in https://github.com/datahub-project/datahub/pull/11965
- feat(ingest): add tests for colon characters in urns by @hsheth2 in https://github.com/datahub-project/datahub/pull/11976
- feat(ingest/athena): handle partition fetching errors by @hsheth2 in https://github.com/datahub-project/datahub/pull/11966
- fix: Add option for disabling ownership extraction by @sagar-salvi-apptware in https://github.com/datahub-project/datahub/pull/11970
- feat(ingest/dremio): Retrieve default_schema for SQL views by @acrylJonny in https://github.com/datahub-project/datahub/pull/11832
- fix(docs): fix sample business glossary by @acrylJonny in https://github.com/datahub-project/datahub/pull/11669
- fix(java-sdk): custom properties patch client by @shirshanka in https://github.com/datahub-project/datahub/pull/11984
- fix[ingest/build]: Disable preflight script as it is not needed anymore by @treff7es in https://github.com/datahub-project/datahub/pull/11989
- feat: connector for Neo4j by @k-bartlett in https://github.com/datahub-project/datahub/pull/11526
- fix(ingestion/dremio): Fixed lineage view for dremio EE by @sagar-salvi-apptware in https://github.com/datahub-project/datahub/pull/11990
- fix(ingest/gc): delete invalid dpis by @anshbansal in https://github.com/datahub-project/datahub/pull/11998
- feat(airflow): show dag/task logs in CI by @hsheth2 in https://github.com/datahub-project/datahub/pull/11981
- chore(ingest): remove deprecated calls to Urn.create_from_string by @hsheth2 in https://github.com/datahub-project/datahub/pull/11983
- fix(ingest): resolve missing numeric types for profiling by @mayurinehate in https://github.com/datahub-project/datahub/pull/11991
- fix(docs): Add spark.datahub.stage_metadata_coalescing to recommended configuration for databricks by @acrylJonny in https://github.com/datahub-project/datahub/pull/11800
- build(coverage): enable code coverage for java and python by @chakru-r in https://github.com/datahub-project/datahub/pull/11992
- chore(docs): Update v_0_3_7.md - v0.3.7.5 by @david-leifker in https://github.com/datahub-project/datahub/pull/12005
- feat(java-sdk): add utils classes to give equivalence with python uti… by @shirshanka in https://github.com/datahub-project/datahub/pull/12002
- fix(ingest/sagemaker): Gracefully handle missing model group by @treff7es in https://github.com/datahub-project/datahub/pull/12000
- fix(ingest/gc): typo fix, do not delete empty entities by @anshbansal in https://github.com/datahub-project/datahub/pull/12011
- fix(ingest/gc): do not cleanup empty job/flow by @anshbansal in https://github.com/datahub-project/datahub/pull/12013
- fix(test): fix metadata-io tests by @david-leifker in https://github.com/datahub-project/datahub/pull/12006
- fix(ingest/looker): Don't fail on unknown liquid filters by @treff7es in https://github.com/datahub-project/datahub/pull/12014
- feat(docs-website) fix links by @jayacryl in https://github.com/datahub-project/datahub/pull/12019
- fix(ci): fix datahub-client validatePythonEnv by @david-leifker in https://github.com/datahub-project/datahub/pull/12023
- test(urn-validation): additional test case by @david-leifker in https://github.com/datahub-project/datahub/pull/12001
- feat(hudi): add hudi platform to the list of default platforms by @shirshanka in https://github.com/datahub-project/datahub/pull/11993
- fix(airflow): fix AthenaOperator extraction by @steffengr in https://github.com/datahub-project/datahub/pull/11857
- feat(tableau): review reporting and debug traces by @sgomezvillamor in https://github.com/datahub-project/datahub/pull/12015
- fix(ingest/tableau): make
sites.get_by_id
call optional by @hsheth2 in https://github.com/datahub-project/datahub/pull/12024 - feat(cli): add platform filter for undo soft delete by @anshbansal in https://github.com/datahub-project/datahub/pull/12012
- feat(mcp): add kafka batch processing mode option (#4449) by @david-leifker in https://github.com/datahub-project/datahub/pull/12021
- chore: update label for team by @anshbansal in https://github.com/datahub-project/datahub/pull/12032
- fix(ui): Adding overflow handling (also goes to oss) by @jjoyce0510 in https://github.com/datahub-project/datahub/pull/12022
- fix(ingest/pulsar): handle missing/invalid schema objects by @Alice-608 in https://github.com/datahub-project/datahub/pull/11945
- fix(filters) Fix issues with structured properties filters by @chriscollins3456 in https://github.com/datahub-project/datahub/pull/11946
- fix(ingest): avoid bad IPython version by @hsheth2 in https://github.com/datahub-project/datahub/pull/12035
- feat(ingest/kafka): additional validation for oauth_db signature by @mayurinehate in https://github.com/datahub-project/datahub/pull/11996
- fix(ingest/gc): Adding test and more checks to gc source by @treff7es in https://github.com/datahub-project/datahub/pull/12027
- fix(graph-edge): fix graph edge delete exception by @david-leifker in https://github.com/datahub-project/datahub/pull/12025
- feat(ingest): add urn validation test files by @hsheth2 in https://github.com/datahub-project/datahub/pull/12036
- chore(deps): bump cross-spawn from 7.0.3 to 7.0.6 in /datahub-web-react by @dependabot in https://github.com/datahub-project/datahub/pull/11978
- fix(datahub-client): prevent unneeded classes in datahub-client jar by @david-leifker in https://github.com/datahub-project/datahub/pull/12037
- fix(entity-service): no-op batches by @david-leifker in https://github.com/datahub-project/datahub/pull/12047
- docs(compliance-forms) update guide for creating form via UI by @maggiehays in https://github.com/datahub-project/datahub/pull/11936
- feat(snowflake): adding oauth token bypass to snowflake by @gabe-lyons in https://github.com/datahub-project/datahub/pull/12048
- fix(ingest): avoid shell entities during view lineage generation by @mayurinehate in https://github.com/datahub-project/datahub/pull/12044
- fix(logs): add actor urn on unauthorised by @anshbansal in https://github.com/datahub-project/datahub/pull/12030
- fix(ingest/snowflake): Add handling of Hybrid Table type for Snowflake ingestion by @siong-tcha in https://github.com/datahub-project/datahub/pull/12039
- fix(ingest/powerbi): reduce type cast usage by @hsheth2 in https://github.com/datahub-project/datahub/pull/12004
- refactor(ingest/sql): add _get_view_definition helper method by @hsheth2 in https://github.com/datahub-project/datahub/pull/12033
- feat(ingest/superset): initial support for superset datasets by @hwmarkcheng in https://github.com/datahub-project/datahub/pull/11972
- fix(ingest/sagemaker): Adding option to control retry for any aws source by @treff7es in https://github.com/datahub-project/datahub/pull/8727
- fix(ingest/gc): Additional dataprocess cleanup fixes by @treff7es in https://github.com/datahub-project/datahub/pull/12049
- feat(tableau): adds more reporting metrics to better understand lineage construction in tableau ingestion by @sgomezvillamor in https://github.com/datahub-project/datahub/pull/12008
- feat(ingestion/tableau): hidden asset handling by @haeniya in https://github.com/datahub-project/datahub/pull/11559
- feat(airflow): drop Airflow < 2.3 support + make plugin v2 the default by @hsheth2 in https://github.com/datahub-project/datahub/pull/12056
- fix(web) disallow deselecting all degrees on impact analysis view by @jayacryl in https://github.com/datahub-project/datahub/pull/12063
- feat: Add parent container hierarchy label to the container by @kanavnarula in https://github.com/datahub-project/datahub/pull/11705
- fix(py-sdk): DataJobPatchBuilder handling timestamps, output edges by @shirshanka in https://github.com/datahub-project/datahub/pull/12067
- fix(plugin-logging): adjust error logging in plugin registry by @david-leifker in https://github.com/datahub-project/datahub/pull/12064
- build(metadata-events): fix shell interpreter mismatch in build script by @chakru-r in https://github.com/datahub-project/datahub/pull/12066
- fix(entity-service): handle no-op system-metadata batches by @david-leifker in https://github.com/datahub-project/datahub/pull/12055
- build(coverage): rename python coverage reports by @chakru-r in https://github.com/datahub-project/datahub/pull/12071
- fix(ingest): replace sqllineage/sqlparse with our SQL parser by @sagar-salvi-apptware in https://github.com/datahub-project/datahub/pull/12020
- fix(entity-service): prevent mutation of systemMetdata on prev by @david-leifker in https://github.com/datahub-project/datahub/pull/12081
- build(datahub-frontend): enable code-coverage by @chakru-r in https://github.com/datahub-project/datahub/pull/12072
- build(ci): codecov integration by @chakru-r in https://github.com/datahub-project/datahub/pull/12073
- fix(openapi): adds in previously ignored keep alive value by @RyanHolstien in https://github.com/datahub-project/datahub/pull/12068
- feat(ui) Add alchemy component library to FE by @chriscollins3456 in https://github.com/datahub-project/datahub/pull/12054
- docs(structured properties) add guide by @maggiehays in https://github.com/datahub-project/datahub/pull/12070
- feat(ingest): allow max_workers=1 with ASYNC_BATCH rest sink by @hsheth2 in https://github.com/datahub-project/datahub/pull/12088
- fix(openapi): fix sort criteria parameter by @RyanHolstien in https://github.com/datahub-project/datahub/pull/12090
- feat(ingest/snowflake): allow option for incremental properties by @mayurinehate in https://github.com/datahub-project/datahub/pull/12080
- fix(cli): don't use /api in gms url by @anshbansal in https://github.com/datahub-project/datahub/pull/12083
- docs(ingest/athena): update recipe with aws key pair example by @mayurinehate in https://github.com/datahub-project/datahub/pull/12076
- fix(ingest/gc): minor tweak gc source by @anshbansal in https://github.com/datahub-project/datahub/pull/12093
- fix(ingest/abs): detect jsonl schema by @acrylJonny in https://github.com/datahub-project/datahub/pull/11775
- feat(ingest/kafka): Flag for optional schemas ingestion by @skrydal in https://github.com/datahub-project/datahub/pull/12077
New Contributors
- @margaridafernandes-trip made their first contribution in https://github.com/datahub-project/datahub/pull/11784
- @k-bartlett made their first contribution in https://github.com/datahub-project/datahub/pull/11526
- @chakru-r made their first contribution in https://github.com/datahub-project/datahub/pull/11992
- @Alice-608 made their first contribution in https://github.com/datahub-project/datahub/pull/11945
- @siong-tcha made their first contribution in https://github.com/datahub-project/datahub/pull/12039
- @hwmarkcheng made their first contribution in https://github.com/datahub-project/datahub/pull/11972
Full Changelog: https://github.com/datahub-project/datahub/compare/v0.15.0rc2...v0.15.0rc3
v0.15.0rc2
Released on 2024-11-27 by @david-leifker.
View the release notes for v0.15.0rc2 on GitHub.
v0.14.1
Released on 2024-09-17 by @david-leifker.
DataHub v0.14.1 Release Notes
User Experience
Enhanced Data Propagation UI: New features allow viewing propagated column documentation, source information, and asset-level propagation details. This improves visibility into data lineage and enables better understanding of data flow across the organization. (#11047)
Improved Search Result Tracking: Added page number to search result click events, enabling better measurement of search ranking performance. This helps users understand and optimize their search experience. (#11151)
Fixed Display Issues: Resolved issues with displaying "0" values for last ingested data and improved handling of multilingual characters in descriptions. These fixes ensure more accurate and readable information presentation. (#10840, #10975)
Developer Experience
Performance Improvements:
Enhanced Search Capabilities:
- Added support for custom highlighting fields in GraphQL queries, allowing faster and more customizable data retrieval. (#11339)
- Implemented new search query functionality to filter by parents/children of Domains or Containers. (#11279)
- Added support for multiple values in 'CONTAIN', 'START_WITH', and 'END_WITH' operators, enabling more flexible and precise searches. (#11068)
API Improvements:
Bug Fixes:
- Resolved issues with forward slash handling in search queries, empty key-value pairs in Elasticsearch mapping, and support for various data types in object fields. These fixes improve search accuracy and data representation. (#10932, #11004, #11066)
- Addressed Postgres regression by upgrading the ebean library from version 12.x to 15.x, resolving a read lock NPE issue. (#11379)
Metadata Ingestion
S3 Integration Enhancements:
BigQuery Improvements:
- Implemented query log extractor for BigQuery, creating "Query" entities with usage statistics, lineage, and operation details. (#10994)
- Added support for filtering GCP project ingestion based on project labels, enabling more targeted data collection. (#11169)
- Implemented query job retries for transient errors, improving system robustness. (#11162)
Snowflake Updates:
New and Updated Connectors:
- Added ingestion source for SAP Analytics Cloud, expanding DataHub's integration capabilities. ([#109](https://github.com/datahub-project/datahub/pull/109)58)
- Enhanced Salesforce connector with customizable API version and improved error messages. (#11145, #11266)
- Updated Tableau ingestion process with new parameters and improved field type parsing. (#11255, #11202)
Other Ingestion Improvements:
- Added support for MongoDB database ingestion as containers. (#11178)
- Implemented automatic capturing of Snowflake assets with Pandas I/O Manager in Dagster module. (#11189)
- Enhanced Fivetran ingestion with destination ID filtering capabilities. (#11277)
- Added support for browse-only tables in Databricks ingestion. (#10766)
Other Improvements and Fixes
- Upgraded various dependencies including Kafka, Azure Identity, Acryl-SQLglot, and GraphQL/Spring versions.
- Improved error handling and logging across multiple components.
- Enhanced test coverage and reliability.
- Updated documentation for various features and processes.
Breaking Changes
Notable breaking changes include:
- Removal of
lower
method fromget_db_name
inSQLAlchemySource
, affecting URNs of related entities. - Changes to default sink mode and aspect handling that require server version 0.14.0+.
See the full details here.
Contributors
We extend our heartfelt thanks to all contributors for their valuable work on this release:
First-Time Contributors
@AaronYang0628, @alexandrebunn, @alisa-aylward-toast, @arpanchakra29, @esselius, @eunseokyang, @ignitz, @milindgupta, @milindgupta9, @Nbagga14, @rohansun, @sakethvarma397, @vignesh-hbk
Repeat Contributors
@deepgarg-visa, @dushayntAW, @feldjay, @filipe-caetano-ovo, @ksrinath, @Masterchen09, @matthew-coudert-cko, @mayurinehate, @nmbryant, @pinakipb2, @prashanthic23, @sagar-salvi-apptware, @siladitya2, @sleeperdeep
DataHub Maintainers
@anshbansal, @asikowitz, @chriscollins3456, @darnaut, @david-leifker, @eboneil, @hsheth2, @jjoyce0510, @maggiehays, @pedro93, @RyanHolstien, @shirshanka, @sid-acryl, @skrydal, @treff7es, @yoonhyejin
Your contributions are invaluable in making DataHub better for everyone. Thank you!
What's Changed
- test(smoke-test): updates to smoke-tests by @david-leifker in https://github.com/datahub-project/datahub/pull/11152
- feat(dbt): support prefer_sql_parser_lineage with sources enabled by @hsheth2 in https://github.com/datahub-project/datahub/pull/11168
- feat(actions): updates to gha workflows by @david-leifker in https://github.com/datahub-project/datahub/pull/11150
- build: fix docker warnings by @anshbansal in https://github.com/datahub-project/datahub/pull/11163
- feat(hooks): Make hook enable flag non-default by @pedro93 in https://github.com/datahub-project/datahub/pull/11159
- fix(ci): smoke-test changes do not need to build images by @david-leifker in https://github.com/datahub-project/datahub/pull/11174
- fix(ci): fix single tag comma split by @david-leifker in https://github.com/datahub-project/datahub/pull/11179
- lint(restore-indices): clean-up restore indices class by @david-leifker in https://github.com/datahub-project/datahub/pull/11176
- fix(ci): typo by @david-leifker in https://github.com/datahub-project/datahub/pull/11180
- fix(ci): additional ci and smoke-test updates by @david-leifker in https://github.com/datahub-project/datahub/pull/11183
- test(smoke-test): minor update to openapi test by @david-leifker in https://github.com/datahub-project/datahub/pull/11184
- feat(ingest): use pre-built dockerize binary by @hsheth2 in https://github.com/datahub-project/datahub/pull/11181
- doc: mark deprecated feature by @anshbansal in https://github.com/datahub-project/datahub/pull/11175
- fix(delete) Fix removing completed/verified forms references by @chriscollins3456 in https://github.com/datahub-project/datahub/pull/11172
- feat(docs): update docs for new release by @RyanHolstien in https://github.com/datahub-project/datahub/pull/11164
- fix(ingest): invalid urn should not fail full batch of changes by @RyanHolstien in https://github.com/datahub-project/datahub/pull/11187
- fix(kafka-setup): add missing script to image by @david-leifker in https://github.com/datahub-project/datahub/pull/11190
- fix(config): fix hash algo config by @david-leifker in https://github.com/datahub-project/datahub/pull/11191
- feat(ingest): allow custom SF API version by @skrydal in https://github.com/datahub-project/datahub/pull/11145
- fix(ingestion/transformer): extend dataset_to_data_product_urns_pattern to support containers by @sagar-salvi-apptware in https://github.com/datahub-project/datahub/pull/11124
- fix(ui) Fix bug with editing entity names by @chriscollins3456 in https://github.com/datahub-project/datahub/pull/11186
- ci(smoke-test): allow smoke-test only PRs by @david-leifker in https://github.com/datahub-project/datahub/pull/11194
- feat(ingestion/lookml): support looker
-- if
comments by @sid-acryl in https://github.com/datahub-project/datahub/pull/11113 - fix(elasticsearch): refactor idHashAlgo setting by @david-leifker in https://github.com/datahub-project/datahub/pull/11193
- fix(ingestion/airflow-plugin): fixed missing inlet/outlets by @dushayntAW in https://github.com/datahub-project/datahub/pull/11101
- docs(readme): add security notes by @david-leifker in https://github.com/datahub-project/datahub/pull/11196
- docs: Update README.md by @prashanthic23 in https://github.com/datahub-project/datahub/pull/11144
- feat(ingest/dbt): skip CLL on sources with
skip_sources_in_lineage
by @hsheth2 in https://github.com/datahub-project/datahub/pull/11195 - fix(graphql): Correct ownership check when removing owners by @pedro93 in https://github.com/datahub-project/datahub/pull/11154
- feat(propagation): UI for rendering propagated column documentation by @jjoyce0510 in https://github.com/datahub-project/datahub/pull/11047
- fix(ui): checks truthy value for last ingested by @pinakipb2 in https://github.com/datahub-project/datahub/pull/10840
- docs(scim): document okta integration with datahub for scim provisioning by @ksrinath in https://github.com/datahub-project/datahub/pull/11120
- fix(ingestion/tableau): Tableau field type parsing by @skrydal in https://github.com/datahub-project/datahub/pull/11202
- feat(analytics): Add page number to SearchResultClickEvent analytics event by @filipe-caetano-ovo in https://github.com/datahub-project/datahub/pull/11151
- fix(graphql) Fix NPE on form actor assignemnt by @chriscollins3456 in https://github.com/datahub-project/datahub/pull/11203
- fix(tests): Bump databricks-sdk dependency to
>=0.30.0
by @skrydal in https://github.com/datahub-project/datahub/pull/11209 - chore(vulnerability): Log Injection (High) by @pinakipb2 in https://github.com/datahub-project/datahub/pull/11131
- feat(ingestion/bigquery): Add ability to filter GCP project ingestion based on project labels by @sid-acryl in https://github.com/datahub-project/datahub/pull/11169
- chore(kafka): kafka version bump by @david-leifker in https://github.com/datahub-project/datahub/pull/11211
- fix(forms) Fix small bug in createForm graphql endpoint by @chriscollins3456 in https://github.com/datahub-project/datahub/pull/11216
- fix(ingestion/lookml): drop
hive.
from CLL by @sid-acryl in https://github.com/datahub-project/datahub/pull/11210 - feat: separate great-expectations action package by @mayurinehate in https://github.com/datahub-project/datahub/pull/11096
- fix(ingest/lookml): support view inheritance for fields by @sid-acryl in https://github.com/datahub-project/datahub/pull/11148
- feat(ingest/mongodb): Ingest databases as containers by @asikowitz in https://github.com/datahub-project/datahub/pull/11178
- fix(ingest/redshift): avoid asserts in redshift schemas by @hsheth2 in https://github.com/datahub-project/datahub/pull/11219
- feat(ingest/snowflake): allow iceberg tables in lineage and access metadata by @alisa-aylward-toast in https://github.com/datahub-project/datahub/pull/10961
- feat(ingestion/looker): filter Looker dashboards by folder by @sid-acryl in https://github.com/datahub-project/datahub/pull/11205
- fix(ingest/sagemaker): ensure consistent STS token usage with refresh mechanism by @sagar-salvi-apptware in https://github.com/datahub-project/datahub/pull/11170
- feat(ingest/s3): Partition support by @treff7es in https://github.com/datahub-project/datahub/pull/11083
- fix: modify the archived version & update code to download only the a… by @yoonhyejin in https://github.com/datahub-project/datahub/pull/11228
- chore(bump): bump hadoop and dnsjava versions by @david-leifker in https://github.com/datahub-project/datahub/pull/11227
- chore(bump): update graphql & spring version by @david-leifker in https://github.com/datahub-project/datahub/pull/11226
- docs(ingest): update config docs on platform instances by @hsheth2 in https://github.com/datahub-project/datahub/pull/11206
- feat(ingest/dbt): add support for urns in add_owner directive by @hsheth2 in https://github.com/datahub-project/datahub/pull/11221
- fix(ingest/snowflake): propagate table list from main to query extractor by @hsheth2 in https://github.com/datahub-project/datahub/pull/11222
- chore(bump): bump kafka base image by @david-leifker in https://github.com/datahub-project/datahub/pull/11236
- fix(datahub-frontend): remove old test creds by @david-leifker in https://github.com/datahub-project/datahub/pull/11237
- docs: Update confluent-cloud.md by @alexandrebunn in https://github.com/datahub-project/datahub/pull/11212
- docs(update): Security stance docs.md by @david-leifker in https://github.com/datahub-project/datahub/pull/11241
- feat(ingest): add bigquery-queries source by @mayurinehate in https://github.com/datahub-project/datahub/pull/10994
- fix(spark-lineage): enable user with editor role to ingest dataProces… by @deepgarg-visa in https://github.com/datahub-project/datahub/pull/11130
- fix(analytics): index description so analytics are correct by @anshbansal in https://github.com/datahub-project/datahub/pull/11224
- chore(bump): bump azure-identity by @david-leifker in https://github.com/datahub-project/datahub/pull/11235
- feat(docs): Update docs on managing user subscriptions by @pedro93 in https://github.com/datahub-project/datahub/pull/11243
- fix(cli/delete): change filter to include env by @anshbansal in https://github.com/datahub-project/datahub/pull/11246
- Rephrase scope of automated scanning by @darnaut in https://github.com/datahub-project/datahub/pull/11248
- docs(urn): Update urn docs with restrictions by @eboneil in https://github.com/datahub-project/datahub/pull/11213
- feat(ingest): add ingestion source for SAP Analytics Cloud by @Masterchen09 in https://github.com/datahub-project/datahub/pull/10958
- feat(ingest/superset): clickhousedb -> clickhouse mapping in superset ingestion by @esselius in https://github.com/datahub-project/datahub/pull/11201
- feat(ingest/bigquery): Add query job retries for transient errors by @feldjay in https://github.com/datahub-project/datahub/pull/11162
- Replacing ant dropdown 'overlay' with 'menu' by @sakethvarma397 in https://github.com/datahub-project/datahub/pull/11229
- fix(spark-lineage): exclude log4j.xml and log4j2.xml from openlineage… by @deepgarg-visa in https://github.com/datahub-project/datahub/pull/11239
- fix(ingest/snowflake): exclude snowflake excluded tags by @alisa-aylward-toast in https://github.com/datahub-project/datahub/pull/11250
- fix(ingest/kafka): update warning reporting for kafka by @hsheth2 in https://github.com/datahub-project/datahub/pull/11171
- chore(vulnerability): Incomplete string escaping or encoding by @pinakipb2 in https://github.com/datahub-project/datahub/pull/11060
- docs: fix great-expectations doc module_name by @mayurinehate in https://github.com/datahub-project/datahub/pull/11253
- feat(ingest/dagster): Add automatic snowflake_pandas_io_manager asset capture by @treff7es in https://github.com/datahub-project/datahub/pull/11189
- chore: update contributor list by @sakethvarma397 in https://github.com/datahub-project/datahub/pull/11257
- fix: Refactoring the antd Modal
visible
property toopen
by @sakethvarma397 in https://github.com/datahub-project/datahub/pull/11232 - ci(build): update outdated action & pin deepdiff lib by @anshbansal in https://github.com/datahub-project/datahub/pull/11260
- feat(ingestion-base): convert to ubuntu image by @david-leifker in https://github.com/datahub-project/datahub/pull/11263
- ci: update outdated actions for java and python setup by @anshbansal in https://github.com/datahub-project/datahub/pull/11261
- chore(platform): Adding Dagster and Prefect platforms by @treff7es in https://github.com/datahub-project/datahub/pull/11264
- fix(ingestion/prefect-plugin): fixed the unit tests by @dushayntAW in https://github.com/datahub-project/datahub/pull/10643
- fix(build/spark): Add explicit dependency to openlineage-converter by @treff7es in https://github.com/datahub-project/datahub/pull/11268
- ci(flavor): reintroduce flavor suffix by @david-leifker in https://github.com/datahub-project/datahub/pull/11265
- feat(ingest/snowflake): Add cluster formula to dataset properties by @alisa-aylward-toast in https://github.com/datahub-project/datahub/pull/11254
- fix(ingestion-base): add missing util by @david-leifker in https://github.com/datahub-project/datahub/pull/11269
- feat(build): remove base-requirements.txt by @hsheth2 in https://github.com/datahub-project/datahub/pull/11238
- build(deps): bump webpack from 5.91.0 to 5.94.0 in /docs-website by @dependabot in https://github.com/datahub-project/datahub/pull/11258
- build(deps): bump micromatch from 4.0.5 to 4.0.8 in /docs-website by @dependabot in https://github.com/datahub-project/datahub/pull/11242
- feat(ingest/s3): Support reading S3 file type by @asikowitz in https://github.com/datahub-project/datahub/pull/11177
- fix(openlineage): fix jar conflict by @david-leifker in https://github.com/datahub-project/datahub/pull/11278
- fix(ingest): limit number of upstreams generated by sql parsing aggre… by @mayurinehate in https://github.com/datahub-project/datahub/pull/11267
- feat(ingest/fivetran): support filtering on destination ids by @matthew-coudert-cko in https://github.com/datahub-project/datahub/pull/11277
- feat(ingest/bq): integrate bigquery-queries into main source by @mayurinehate in https://github.com/datahub-project/datahub/pull/11247
- doc(acryl cloud): release notes for 0.3.5.x by @anshbansal in https://github.com/datahub-project/datahub/pull/11259
- feat(ingest/databricks): include metadata for browse only tables by @mayurinehate in https://github.com/datahub-project/datahub/pull/10766
- fix(docs): fix logout url by @david-leifker in https://github.com/datahub-project/datahub/pull/11294
- feat(ingest): add python deps for
apk
by @hsheth2 in https://github.com/datahub-project/datahub/pull/11188 - fix(ingest/mssql): remove lower() method from sql_common get_db_name by @sleeperdeep in https://github.com/datahub-project/datahub/pull/10773
- feat(graphql): Lazy dataLoaders by @david-leifker in https://github.com/datahub-project/datahub/pull/11293
- fix(bigquery): followups on bigquery queries v2 integration by @mayurinehate in https://github.com/datahub-project/datahub/pull/11291
- fix(ingest): add custom StrEnum type by @hsheth2 in https://github.com/datahub-project/datahub/pull/11270
- feat(schemaField): populate schemaFields with side effects by @david-leifker in https://github.com/datahub-project/datahub/pull/10928
- fix(ingest/prefect): Temporary pinning Prefect 2.x until we can upgrade to 3.x by @treff7es in https://github.com/datahub-project/datahub/pull/11302
- feat(ingest/athena): Add option to disable partition extraction by @treff7es in https://github.com/datahub-project/datahub/pull/11286
- docs(adoption): Add Inter&Co by @ignitz in https://github.com/datahub-project/datahub/pull/11299
- fix(api/timeline): fix corner cases missed, add tests by @anshbansal in https://github.com/datahub-project/datahub/pull/11288
- config(kafka): clean-up kafka serializer config by @david-leifker in https://github.com/datahub-project/datahub/pull/11303
- fix(ingest/protobuf): Improve String Handling for Multilingual Support in Descriptions by @eunseokyang in https://github.com/datahub-project/datahub/pull/10975
- feat(ingest): Support protobuf description for enum field by @eunseokyang in https://github.com/datahub-project/datahub/pull/11027
- fix(search): Search not returning result if query text contains forward slash by @siladitya2 in https://github.com/datahub-project/datahub/pull/10932
- feat(ingest/salesforce): helpful error messages on failure by @mayurinehate in https://github.com/datahub-project/datahub/pull/11266
- fix(search): fix regression from #10932 by @david-leifker in https://github.com/datahub-project/datahub/pull/11309
- chore(vulnerability): Insecure randomness by @pinakipb2 in https://github.com/datahub-project/datahub/pull/11058
- feat(ingest/sql): add default dialect support to SqlQueriesSource by @rohansun in https://github.com/datahub-project/datahub/pull/11285
- fix : added support for multiple values for CONTAIN, START_WITH and END_WITH operators by @Nbagga14 in https://github.com/datahub-project/datahub/pull/11068
- feat(ingest): enable query usage stats by default by @hsheth2 in https://github.com/datahub-project/datahub/pull/11281
- build(deps): bump micromatch from 4.0.5 to 4.0.8 in /datahub-web-react by @dependabot in https://github.com/datahub-project/datahub/pull/11296
- fix(docs): Add correct link for automations by @jjoyce0510 in https://github.com/datahub-project/datahub/pull/11323
- feat(cli): reject missing urns in
datahub get
by @hsheth2 in https://github.com/datahub-project/datahub/pull/11313 - fix(smoke): fix timeseries delete test's usage of
datahub get
by @hsheth2 in https://github.com/datahub-project/datahub/pull/11330 - feat(ingest): make rest emitter version error messages more clear by @hsheth2 in https://github.com/datahub-project/datahub/pull/11295
- docs(ingest/dbt): clarify dbt ingestion docs by @hsheth2 in https://github.com/datahub-project/datahub/pull/11312
- fix(py): fix issues with AvroException by @hsheth2 in https://github.com/datahub-project/datahub/pull/11311
- fix(ingestion/tableau): restructure the tableau graphql datasource query by @sid-acryl in https://github.com/datahub-project/datahub/pull/11230
- fix(ingest): disable reporting for dry-run pipelines by @hsheth2 in https://github.com/datahub-project/datahub/pull/11306
- feat(ingest): support full urns without owner_type in meta mapping by @hsheth2 in https://github.com/datahub-project/datahub/pull/11298
- feat(ingest/sql): auto extract and use mode query user metadata by @mayurinehate in https://github.com/datahub-project/datahub/pull/11307
- fix(version): forUpdate needed for versioning by @david-leifker in https://github.com/datahub-project/datahub/pull/11328
- fix(ingest): avoid sqlite "too many SQL variables" error by @hsheth2 in https://github.com/datahub-project/datahub/pull/11332
- chore(ingest): bump acryl-sqlglot by @hsheth2 in https://github.com/datahub-project/datahub/pull/11331
- docs(oidc): document azure logout uri by @david-leifker in https://github.com/datahub-project/datahub/pull/11344
- feat(logging): add option to log slow GraphQL queries by @nmbryant in https://github.com/datahub-project/datahub/pull/11308
- docs(ingest/dbt): add docs on hiding sources by @hsheth2 in https://github.com/datahub-project/datahub/pull/11334
- feat(mode/ingest): Add support for missing Mode datasets in lineage by @sagar-salvi-apptware in https://github.com/datahub-project/datahub/pull/11290
- feat(entity-service): fallback logic for aspect version by @david-leifker in https://github.com/datahub-project/datahub/pull/11304
- fix(ingest/bq): fix ordering of queries for use_queries_v2 by @mayurinehate in https://github.com/datahub-project/datahub/pull/11333
- docs(updating-datahub) Bump minor version on v0.14.0 notes by @maggiehays in https://github.com/datahub-project/datahub/pull/11255
- docs(data product): Update example and docs by @eboneil in https://github.com/datahub-project/datahub/pull/11032
- feat(ingest): maintain ordering in file-backed dict by @hsheth2 in https://github.com/datahub-project/datahub/pull/11346
- docs: add signup form in cloud by @yoonhyejin in https://github.com/datahub-project/datahub/pull/11129
- config(retention): update dataHubExecutionRequestResult by @david-leifker in https://github.com/datahub-project/datahub/pull/11348
- feat(grafana): Using v2 metrics update datahub dashboard by @AaronYang0628 in https://github.com/datahub-project/datahub/pull/11208
- chore(links): add attribution by @shirshanka in https://github.com/datahub-project/datahub/pull/11352
- fix(timeline api): adding modification category by @sakethvarma397 in https://github.com/datahub-project/datahub/pull/11345
- Feature/custom highlight on search by @arpanchakra29 in https://github.com/datahub-project/datahub/pull/11339
- fix(gms): filter out runs of a dataJob without any run-events by @ksrinath in https://github.com/datahub-project/datahub/pull/11223
- fix(ingest): followup on bigquery queries v2 ordering by @mayurinehate in https://github.com/datahub-project/datahub/pull/11353
- fix(ingest/databricks): use latest report message format for warning messages by @sid-acryl in https://github.com/datahub-project/datahub/pull/11319
- chore(ingest): improve code formatting by @hsheth2 in https://github.com/datahub-project/datahub/pull/11326
- chore(py): cleanup python CI by @hsheth2 in https://github.com/datahub-project/datahub/pull/11324
- feat(auth): implement session authorization cache by @david-leifker in https://github.com/datahub-project/datahub/pull/11327
- feat(search): search query rewriter by @david-leifker in https://github.com/datahub-project/datahub/pull/11279
- feat(openapi-v3): add additional delete options by @david-leifker in https://github.com/datahub-project/datahub/pull/11347
- perf(search): reduce highlight fragments by @david-leifker in https://github.com/datahub-project/datahub/pull/11349
- feat(throttle): extend throttling to API requests by @david-leifker in https://github.com/datahub-project/datahub/pull/11325
- fix(browse): adjust browse to use full text in line with search by @RyanHolstien in https://github.com/datahub-project/datahub/pull/11367
- Fix: bug fix for empty key values pair in elastic search mapping by @milindgupta9 in https://github.com/datahub-project/datahub/pull/11004
- feat(ingest): make default rest sink mode env-configurable by @hsheth2 in https://github.com/datahub-project/datahub/pull/11335
- feat: add acryl stories by @yoonhyejin in https://github.com/datahub-project/datahub/pull/11351
- fix: add cloud form & fix css by @yoonhyejin in https://github.com/datahub-project/datahub/pull/11362
- fix(NPE): fix NPE in EntityService by @david-leifker in https://github.com/datahub-project/datahub/pull/11373
- feat(ingest/dbt): add
only_include_if_in_catalog
flag for dbt core by @hsheth2 in https://github.com/datahub-project/datahub/pull/11314 - chore(actions): bump actions version in docker profiles by @david-leifker in https://github.com/datahub-project/datahub/pull/11377
- fix(ingest/nifi): add error handling for version by @anshbansal in https://github.com/datahub-project/datahub/pull/11385
- fix(XServiceProvider): fix ebean framework race condition by @david-leifker in https://github.com/datahub-project/datahub/pull/11378
- fix(docs): clarify clean-up of indices when restoring search and graph indices by @Masterchen09 in https://github.com/datahub-project/datahub/pull/11380
- feat(ingest): report ingest run for sample data by @hsheth2 in https://github.com/datahub-project/datahub/pull/11329
- fix(ebean): upgrade ebean library by @david-leifker in https://github.com/datahub-project/datahub/pull/11379
- fix(ingest/snowflake): Update snowflake_utils.py to account for iceberg tables by @alisa-aylward-toast in https://github.com/datahub-project/datahub/pull/11384
- feat(ingest): default to ASYNC_BATCH mode in datahub-rest sink by @hsheth2 in https://github.com/datahub-project/datahub/pull/11369
- feat(graphql): Support START_WITH and END_WITH operator in GraphQL API by @milindgupta in https://github.com/datahub-project/datahub/pull/11026
- fix: support for non-string types in object fields by @vignesh-hbk in https://github.com/datahub-project/datahub/pull/11066
- refactor(search): refactor field type detection by @david-leifker in https://github.com/datahub-project/datahub/pull/11395
New Contributors
- @prashanthic23 made their first contribution in https://github.com/datahub-project/datahub/pull/11144
- @alisa-aylward-toast made their first contribution in https://github.com/datahub-project/datahub/pull/10961
- @alexandrebunn made their first contribution in https://github.com/datahub-project/datahub/pull/11212
- @esselius made their first contribution in https://github.com/datahub-project/datahub/pull/11201
- @sakethvarma397 made their first contribution in https://github.com/datahub-project/datahub/pull/11229
- @ignitz made their first contribution in https://github.com/datahub-project/datahub/pull/11299
- @eunseokyang made their first contribution in https://github.com/datahub-project/datahub/pull/10975
- @rohansun made their first contribution in https://github.com/datahub-project/datahub/pull/11285
- @Nbagga14 made their first contribution in https://github.com/datahub-project/datahub/pull/11068
- @AaronYang0628 made their first contribution in https://github.com/datahub-project/datahub/pull/11208
- @arpanchakra29 made their first contribution in https://github.com/datahub-project/datahub/pull/11339
- @milindgupta9 made their first contribution in https://github.com/datahub-project/datahub/pull/11004
- @milindgupta made their first contribution in https://github.com/datahub-project/datahub/pull/11026
- @vignesh-hbk made their first contribution in https://github.com/datahub-project/datahub/pull/11066
Full Changelog: https://github.com/datahub-project/datahub/compare/v0.14.0.2...v0.14.1
v0.14.0.2
Released on 2024-08-21 by @RyanHolstien.
View the release notes for v0.14.0.2 on GitHub.
v0.14.0
Released on 2024-08-13 by @RyanHolstien.
View the release notes for v0.14.0 on GitHub.
v0.13.3
Released on 2024-05-23 by @david-leifker.
View the release notes for v0.13.3 on GitHub.
v0.13.2
Released on 2024-04-16 by @david-leifker.
View the release notes for v0.13.2 on GitHub.
v0.13.1
Released on 2024-04-02 by @david-leifker.
View the release notes for v0.13.1 on GitHub.
v0.13.0
Released on 2024-02-29 by @RyanHolstien.
View the release notes for v0.13.0 on GitHub.
DataHub v0.12.1
Released on 2023-12-08 by @david-leifker.
View the release notes for DataHub v0.12.1 on GitHub.
v0.12.1rc2
Released on 2023-11-28 by @david-leifker.
View the release notes for v0.12.1rc2 on GitHub.
v0.12.0
Released on 2023-10-25 by @pedro93.
View the release notes for v0.12.0 on GitHub.
v0.11.0
Released on 2023-09-08 by @iprentic.
View the release notes for v0.11.0 on GitHub.
v0.10.5
Released on 2023-08-02 by @david-leifker.
View the release notes for v0.10.5 on GitHub.
v0.10.4
Released on 2023-06-09 by @pedro93.
View the release notes for v0.10.4 on GitHub.
v0.10.3
Released on 2023-05-25 by @iprentic.
View the release notes for v0.10.3 on GitHub.
DataHub v0.10.2
Released on 2023-04-13 by @iprentic.
View the release notes for DataHub v0.10.2 on GitHub.
DataHub v0.10.1
Released on 2023-03-23 by @aditya-radhakrishnan.
View the release notes for DataHub v0.10.1 on GitHub.
DataHub v0.10.0
Released on 2023-02-07 by @david-leifker.
View the release notes for DataHub v0.10.0 on GitHub.
DataHub v0.9.6.1
Released on 2023-01-31 by @david-leifker.
View the release notes for DataHub v0.9.6.1 on GitHub.
DataHub v0.9.6
Released on 2023-01-13 by @maggiehays.
View the release notes for DataHub v0.9.6 on GitHub.
DataHub v0.9.5
Released on 2022-12-23 by @jjoyce0510.
View the release notes for DataHub v0.9.5 on GitHub.
[Known Issues] DataHub v0.9.4
Released on 2022-12-20 by @maggiehays.
View the release notes for [Known Issues] DataHub v0.9.4 on GitHub.
DataHub v0.9.3
Released on 2022-11-30 by @maggiehays.
View the release notes for DataHub v0.9.3 on GitHub.
DataHub v0.9.2
Released on 2022-11-04 by @maggiehays.
View the release notes for DataHub v0.9.2 on GitHub.
DataHub v0.9.1
Released on 2022-10-31 by @maggiehays.
View the release notes for DataHub v0.9.1 on GitHub.
DataHub v0.9.0
Released on 2022-10-11 by @szalai1.
View the release notes for DataHub v0.9.0 on GitHub.
DataHub v0.8.45
Released on 2022-09-23 by @gabe-lyons.
View the release notes for DataHub v0.8.45 on GitHub.
DataHub v0.8.44
Released on 2022-09-01 by @jjoyce0510.
View the release notes for DataHub v0.8.44 on GitHub.
DataHub v0.8.43
Released on 2022-08-09 by @maggiehays.
View the release notes for DataHub v0.8.43 on GitHub.