Skip to content
Projects
Groups
Snippets
Help
Loading...
Help
Submit feedback
Contribute to GitLab
Sign in
Toggle navigation
A
AmendsenProject
Project
Project
Details
Activity
Releases
Cycle Analytics
Repository
Repository
Files
Commits
Branches
Tags
Contributors
Graph
Compare
Charts
Issues
0
Issues
0
List
Board
Labels
Milestones
Merge Requests
0
Merge Requests
0
CI / CD
CI / CD
Pipelines
Jobs
Schedules
Charts
Wiki
Wiki
Snippets
Snippets
Members
Members
Collapse sidebar
Close sidebar
Activity
Graph
Charts
Create a new issue
Jobs
Commits
Issue Boards
Open sidebar
Shaik Janipasha
AmendsenProject
Commits
e57e5c46
Unverified
Commit
e57e5c46
authored
Apr 13, 2020
by
Craig Rosie
Committed by
GitHub
Apr 13, 2020
Browse files
Options
Browse Files
Download
Email Patches
Plain Diff
Fix docs/models.md headings (#240)
parent
81305061
Changes
1
Hide whitespace changes
Inline
Side-by-side
Showing
1 changed file
with
9 additions
and
9 deletions
+9
-9
models.md
docs/models.md
+9
-9
No files found.
docs/models.md
View file @
e57e5c46
...
@@ -17,12 +17,12 @@ without developers needing to know the internals of the neo4j schema.
...
@@ -17,12 +17,12 @@ without developers needing to know the internals of the neo4j schema.
## The Models
## The Models
###TableMetadata
###
TableMetadata
[
python class
](
../databuilder/models/table_metadata.py
)
[
python class
](
../databuilder/models/table_metadata.py
)
*What datasets does my org have?*
*What datasets does my org have?*
####Description
####
Description
This corresponds to a dataset in amundsen and is the core building block.
This corresponds to a dataset in amundsen and is the core building block.
In addition to ColumnMetadata, tableMetadata is one of the first datasets you should extract as
In addition to ColumnMetadata, tableMetadata is one of the first datasets you should extract as
almost everything else depends on these being populated.
almost everything else depends on these being populated.
...
@@ -32,13 +32,13 @@ In general, for Table and Column Metadata, you should be able to use one of the
...
@@ -32,13 +32,13 @@ In general, for Table and Column Metadata, you should be able to use one of the
in the
[
extractor package
](
../databuilder/extractor
)
in the
[
extractor package
](
../databuilder/extractor
)
### Watermark
### Watermark
[
python class
](
../databuilder/models/watermark.py
)
[
python class
](
../databuilder/models/watermark.py
)
*What is the earliest data that this table has? What is the latest data?*
*What is the earliest data that this table has? What is the latest data?*
This is NOT the same as when the data was last updated.
This is NOT the same as when the data was last updated.
####Description
####
Description
Corresponds to the earliest and latest date that a dataset has. Only makes
Corresponds to the earliest and latest date that a dataset has. Only makes
sense if the dataset is timeseries data.
sense if the dataset is timeseries data.
For example, a given table may have data from 2019/01/01 -> 2020/01/01
For example, a given table may have data from 2019/01/01 -> 2020/01/01
...
@@ -55,14 +55,14 @@ Depending on the datastore of your dataset, you would extract this by:
...
@@ -55,14 +55,14 @@ Depending on the datastore of your dataset, you would extract this by:
*How many queries is a given column getting? By which users?*
*How many queries is a given column getting? By which users?*
####Description
Has query counts per a given column per a user. This can help identify
Has query counts per a given column per a user. This can help identify
#### Description
who uses given datasets so people can contact them if they have questions
who uses given datasets so people can contact them if they have questions
on how to use a given dataset or if a dataset is changing. It is also used as a
on how to use a given dataset or if a dataset is changing. It is also used as a
search boost so that the most used tables are put to the top of the search results.
search boost so that the most used tables are put to the top of the search results.
####Extraction
For more traditional databases, there should be system tables where you can obtain
For more traditional databases, there should be system tables where you can obtain
#### Extraction
these sorts of usage statistics.
these sorts of usage statistics.
In other cases, you may need to use audit logs which could require a custom solution.
In other cases, you may need to use audit logs which could require a custom solution.
...
@@ -75,11 +75,11 @@ on a heuristic.
...
@@ -75,11 +75,11 @@ on a heuristic.
*What users are there out there? Which team is this user on?*
*What users are there out there? Which team is this user on?*
####Description
####
Description
Represents all of the metadata for a user at your company.
Represents all of the metadata for a user at your company.
This is required if you are going to be having authentication turned on.
This is required if you are going to be having authentication turned on.
####Extraction
####
Extraction
TODO
TODO
### TableColumnStats
### TableColumnStats
...
@@ -94,7 +94,7 @@ customizable.
...
@@ -94,7 +94,7 @@ customizable.
It also will probably require a distributed cluster in order to calculate these regularly and in general is
It also will probably require a distributed cluster in order to calculate these regularly and in general is
probably the least accessible metrics to get at without a custom solution.
probably the least accessible metrics to get at without a custom solution.
####Extraction
####
Extraction
The idea here would be to implement something that does the following:
The idea here would be to implement something that does the following:
For each table you care about:
For each table you care about:
For each column you care about:
For each column you care about:
...
...
Write
Preview
Markdown
is supported
0%
Try again
or
attach a new file
Attach a file
Cancel
You are about to add
0
people
to the discussion. Proceed with caution.
Finish editing this message first!
Cancel
Please
register
or
sign in
to comment