Instructor: Peter Princen, Sr. Product Manager
Course Objectives:
- Explain onboarding ERP cloud data into data lake built on AWS S3
- Utilize Amazon Athena to access data in AWS S3 data lake
- Examine complete lineage of Tableau workbook and source systems
In this course, we will review a user journey of a business analyst that needs to make a report on sales forecasts in the domain of supply chain. The use case will show that not only Catalog will be used to find the correct data, but it will actually manage the whole process of looking for data, requesting new data to be onboarded into the data lake where our analyst can actually access it, and leading up to creating a report in a BI tool of choice. In our example the BI tool will Tableau, and the data lake will be built on AWS S3.
At the end of our journey, the analyst can access the data that is stored on AWS S3. He can do this by using the keys that have been provided to him by Collibra Catalog. To use these keys access to data stored on S3, the Athena service is used. His BI tool, Tableau, supports the Athena service, so he can select Athena in the list of available sources, and enter the access information that Collibra Catalog provided him with. This way it’s very easy, and very secure for him to access this data. Finally, the Tableau report that is generated by our analyst will be automatically ingested into Collibra Catalog, so it is available for other analysts to satisfy their reporting needs.