Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Alignment data export #409

Closed
wants to merge 31 commits into from
Closed
Changes from 1 commit
Commits
Show all changes
31 commits
Select commit Hold shift + click to select a range
e7667b8
Add sign export script
fsimonjetz Apr 3, 2023
e9894f8
add pandas to dependencies for export script
fsimonjetz Apr 3, 2023
6ec8941
Remove debug limit
fsimonjetz Apr 4, 2023
6f6ea7a
'Refactored by Sourcery' (#410)
sourcery-ai[bot] Apr 4, 2023
30fb091
Refactoring
fsimonjetz Apr 4, 2023
7064a5b
export template
fsimonjetz Apr 4, 2023
40a1731
add caic reports io
fsimonjetz Apr 5, 2023
7efa57e
fix reference aggregation
fsimonjetz Apr 5, 2023
f1e74fd
Move team names into environment variable
fsimonjetz Apr 5, 2023
23fc057
Fix monthly index
fsimonjetz Apr 5, 2023
9ba5f42
Less restrictive field matching
fsimonjetz Apr 5, 2023
58dde97
better references display
fsimonjetz Apr 13, 2023
77b5d98
Merge branch 'master' into alignment-data-export
fsimonjetz Apr 13, 2023
6be4274
Refactoring; add zip to export
fsimonjetz Apr 13, 2023
76f487e
Ignore output files
fsimonjetz Apr 13, 2023
c7ef43d
Use tar.gz instead of zip
fsimonjetz Apr 13, 2023
9ff50e7
Merge branch 'extend-revision-record' into alignment-data-export
fsimonjetz Apr 14, 2023
d635c93
Remove unrelated changes
fsimonjetz Apr 14, 2023
2a8037a
remove vocab generation
fsimonjetz Apr 17, 2023
b55a3cb
Remove old export script
fsimonjetz Apr 21, 2023
9d1eb91
include more chapter infos
fsimonjetz Apr 25, 2023
674e83c
Remove filtering "empty" texts
fsimonjetz Apr 25, 2023
b12ba58
use stage abbreviation instead of long name;
fsimonjetz Apr 25, 2023
bd0a703
Fix dtypes; add url column
fsimonjetz May 10, 2023
e9ad32f
include colophon and unplaced line counts
fsimonjetz Jun 1, 2023
d7c4eae
Always use the production db for export
fsimonjetz Jun 1, 2023
a9456c6
add colophon option
fsimonjetz Jun 2, 2023
1caa714
Merge branch 'master' into alignment-data-export
fsimonjetz Jun 2, 2023
82b55ab
add sign export task
fsimonjetz Jun 2, 2023
41b6c36
reformatting
fsimonjetz Jun 2, 2023
044cf3b
Merge branch 'master' into alignment-data-export
fsimonjetz Jun 2, 2023
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Prev Previous commit
Next Next commit
Always use the production db for export
fsimonjetz committed Jun 1, 2023
commit d7c4eaefe2ce3a13d01acebf96484c6ce534e8e9
3 changes: 1 addition & 2 deletions ebl/io/alignment/data_export.py
Original file line number Diff line number Diff line change
@@ -22,8 +22,7 @@
pd.options.mode.chained_assignment = None

client = MongoClient(os.environ["MONGODB_URI"])
DB = os.environ.get("MONGODB_DB")
database = client.get_database(DB)
database = client.get_database("ebl")
fragments = MongoCollection(database, FRAGMENTS_COLLECTION)
chapters = MongoCollection(database, CHAPTERS_COLLECTION)