Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

数据工程手册目录 #3

Open
Guo-Zhang opened this issue Feb 19, 2024 · 2 comments
Open

数据工程手册目录 #3

Guo-Zhang opened this issue Feb 19, 2024 · 2 comments

Comments

@Guo-Zhang
Copy link
Member

Guo-Zhang commented Feb 19, 2024

Part 方法和工具
Chapter 基本原则
Chapter 基础设施
Section 元数据管理平台

Part 数据交付物
Chapter 数据集
Chapter 数据应用

Part 数据生命周期
Chapter 数据采集
Section 网页爬虫
Subsection AI爬虫

Part 系统治理
Part 研发流程管理
Part 数据共享

@Guo-Zhang
Copy link
Member Author

关于网页爬虫:暂定依次按照传统爬虫、AI爬虫的逻辑组织。如果AI爬虫可以更新换代成功,则可以围绕AI爬虫制作我们的爬虫最佳实践。

@Guo-Zhang
Copy link
Member Author

如果考虑和最新的数据工程标准对应,可以把“方法和工具”并入“系统治理”。

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant