7/27/2023 0 Comments Aws redshift emr msk![]() Its transforms, what can be done with them, how to optimize AWS Glue ETL jobs, know the limits and use cases for AWS Glue Crawlers, AWS Glue Data Catalog and its compatibility with Hive Metastore. If we are on the topic of Processing, pay attention to AWS Glue. You need to have experience with error handling, Amazon Kinesis Producer Library (KPL), Amazon Kinesis Consumer Library (KCL), and you need to know where and how you can use the Random Cut Forest (RCF) algorithm. Pay attention to which services and integrations provide near real-time processing and allow for processing data in an exactly-once manner. ![]() You need to have in-depth knowledge of the limits, relevant data sources, data targets, and delivery guarantees for all Amazon Kinesis services (excluding Amazon Kinesis Video Streams mentioned above). Speaking about Amazon Kinesis - this exam is very Kinesis-heavy. What was there and actually surprised me is a significant amount of questions about AWS Lambda, including consumers for the Amazon Kinesis suite or just for processing data from Amazon S3. The same goes for theory, definitions, AWS Well-Architected Framework. There was no question about Amazon Kinesis Video Streams (however, wait for it as I will explain it later), nor AWS IoT. If there were any, they were pretty basic and well-documented - mostly about EMRFS and the purpose of Apache Hive and Hive Metastore. There were no queries, no SQL, no code examples to analyze (even the smallest bit of IAM policy, nor AWS CloudFormation template).Īs I told you above, there were almost no questions about Hadoop and its internals. Let's start with what surprised me and wasn't available for my exam. □Īs we have the usual stuff covered, let's discuss my thoughts regarding the exam content and coverage. So make sure your bullshit-o-meter is well calibrated.
0 Comments
Leave a Reply. |