data-catalog-ddl-generator

This repository is to help engineers to generate table ddl from

A specific AWS Glue Database & Table name
A specific AWS Glue Database
All Tables in AWS Glue Data Catalog

To run this on your local machine, you will need to have to configure aws credential on your local computer.

Dependency

Please setup AWS CLI Configuration on your local machine.

Start

python generate_specific_table_ddl.py

Information Input

> Enter Database Name temp
> Enter Table Name temp
> Enter Query Output Bucket Name Athena Query Output S3 Bucket

Results

Execution ID: bla-bla-bla
QUEUED
SUCCEEDED
Query "SHOW CREATE TABLE temp.temp;" finished.
CREATE EXTERNAL TABLE `temp.temp`(
  `temp0` string COMMENT 'temp0',
  `temp1` bigint COMMENT 'temp1',
  `temp2` string COMMENT 'temp2')
PARTITIONED BY (
  `dt` string COMMENT 'temp partition')
ROW FORMAT SERDE
  'org.apache.hadoop.hive.ql.io.parquet.serde.ParquetHiveSerDe'
STORED AS INPUTFORMAT
  'org.apache.hadoop.hive.ql.io.parquet.MapredParquetInputFormat'
OUTPUTFORMAT
  'org.apache.hadoop.hive.ql.io.parquet.MapredParquetOutputFormat'
LOCATION
  's3://temp/temp'
TBLPROPERTIES (
  'classification'='parquet',
  'has_encrypted_data'='false',
  'parquet.compress'='SNAPPY')

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
README.md		README.md
generate_all_tables_ddl.py		generate_all_tables_ddl.py
generate_specific_database_tables_ddl.py		generate_specific_database_tables_ddl.py
generate_specific_table_ddl.py		generate_specific_table_ddl.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

data-catalog-ddl-generator

Dependency

Start

Information Input

Results

About

Releases

Packages

Languages

jensenity/data-catalog-ddl-generator

Folders and files

Latest commit

History

Repository files navigation

data-catalog-ddl-generator

Dependency

Start

Information Input

Results

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages