Document

● Which AWS services have you used?

● What is Amazon EC2, and how is it used?

● What are the different instance types in EC2, and how do they differ?

● Explain what Amazon EMR is and its use cases.

● Why have you used EMR?

● What is the difference between Hadoop and Amazon EMR?

● What is Amazon S3, and how is it used for object storage?

● How can you secure data stored in Amazon S3?

● Explain the S3 storage classes and their use cases.

● How can you monitor and manage costs in AWS?

● Features of Cloud?

● Benefits of RDS over on-premise RDBMS?

● What is Amazon Redshift, and how does it differ from traditional relational databases?

● Explain the concept of data warehousing and how Redshift fits into it?

● How can you optimize query performance in Amazon Redshift?

● How can you monitor and manage costs in AWS?

● Changes dependent on Athena?

● Query execution engine in Athena?

● Max size of file upload in S3?

● "What is AWS Glue, and how does it fit into the AWS ecosystem? "

● How does AWS Glue simplify the ETL process for big data?

● What are AWS Glue Jobs and how do they work?

● Can you explain the Glue Data Catalog and its significance?

● How does AWS Glue integrate with other AWS services, such as S3 and Redshift?

● "Where can you schedule Glue Jobs? "

● Where can you check the logs of Glue Jobs?

● What is Amazon Athena, and how does it query data stored in Amazon S3?

● What is Amazon Hadoop distribution?

● What is the default Engine for Athena in AWS

● what is IAM?

● what is VPC? subnet?

● route table , inbound outbound rules?

● s3 storage classes? when we use standard?

● local to EC2 connection?

● what is glacier?

● what is normalization and denormalization?

● when we go for normalization ?

● SCD types?

● OLTP & OLAP difference?

● Star and snowflake schema ?

● what is fact & dimension tables and their relation?

● how they are co-related to each other Fact and Dimension tables?

AWS