O`Reilly 的 《 Architecting Data Lakes Data Management Architectures for Advanced Business Use Cases 》,全面介绍了数据湖的构架、工作机理、构建与管理、规划、价值、展望等诸多方面的内容。

其目录如下:
1. Overview
What Is a Data Lake?
Data Management and Governance in the Data Lake
How to Deploy a Data Lake Management Platform
2. How Data Lakes Work
Four Basic Functions of a Data Lake
Management and Monitoring
3. Challenges and Complications
Challenges of Building a Data Lake
Challenges of Managing the Data Lake
Deriving Value from the Data Lake
4. Curating the Data Lake
Data Governance
Data Acquisition
Data Organization
Capturing Metadata
Data Preparation
Data Provisioning
Benefits of an Automated Approach
5. Deriving Value from the Data Lake
Self-Service
Controlling and Allowing Access
Using a Bottom-Up Approach to Data Governance to Rank Data Sets
Data Lakes in Different Industries
6. Looking Ahead
Ground-to-Cloud Deployment Options
Looking Beyond Hadoop: Logical Data Lakes
Federated Queries
Data Discovery Portals
In Conclusion
A Checklist for Success
完整内容,可以在此下载:http://www.oreilly.com/data/free/architecting-data-lakes.csp?intcmp=il-data-free-lp-lgen_free_reports_page
也可以随时Email:Hiweb@Outlook.com 沟通探讨。
另外有需要云服务器可以了解下创新互联scvps.cn,海内外云服务器15元起步,三天无理由+7*72小时售后在线,公司持有idc许可证,提供“云服务器、裸金属服务器、高防服务器、香港服务器、美国服务器、虚拟主机、免备案服务器”等云主机租用服务以及企业上云的综合解决方案,具有“安全稳定、简单易用、服务可用性高、性价比高”等特点与优势,专为企业上云打造定制,能够满足用户丰富、多元化的应用场景需求。