Unverified Commit af5d4434 authored by Tao Feng's avatar Tao Feng Committed by GitHub

doc: add a faq entry on create_only config (#696)

Signed-off-by: 's avatarTao Feng <fengtao04@gmail.com>
parent 431a279f
......@@ -33,3 +33,15 @@ The data in the metadata store, however, can be preserved when migrating from 1.
v2.0 deployments consists of deployment of all three services along with republishing Elasticsearch document on Table with v2.0 Databuilder.
Keep in mind there is likely to be some downtime as v2.0.0, between deploying 3 services and re-seeding the elasticsearch indexes, so it might be ideal to stage a rollout by datacenter/environment if uptime is key
## How to avoid certain metadatas in Amundsen got erased by databuilder ingestion?
By default, databuilder always upserts the metadata. If you want to prevent that happens on certain type of metadata, you could add the following
config to your databuilder job's config
```python
'publisher.neo4j.{}'.format(neo4j_csv_publisher.NEO4J_CREATE_ONLY_NODES): [DESCRIPTION_NODE_LABEL],
```
This config means that databuilder will only update the table / column description if it doesn't exist before which could be the table is newly created.
This is useful when we treat Amundsen graph as the source of truth for certain types of metadata (e.g description).
\ No newline at end of file
......@@ -69,6 +69,7 @@ nav:
- 'How to setup a preview client with Apache Superset': 'tutorials/data-preview-with-superset.md'
- 'How to setup user profiles': 'tutorials/user-profiles.md'
- 'How to ingest Dashboard': 'databuilder/docs/dashboard_ingestion_guide.md'
- 'How to track user metric for Amundsen': 'tutorials/how-to-track-user-metric.md'
- 'Deployment':
- 'Authentication': 'authentication/oidc.md'
- 'AWS ECS Installation': 'installation-aws-ecs/aws-ecs-deployment.md'
......
Markdown is supported
0% or
You are about to add 0 people to the discussion. Proceed with caution.
Finish editing this message first!
Please register or to comment