● Which AWS services have you used?
● What is Amazon EC2, and how is it used?
● What are the different instance types in EC2, and how do they differ?
● Explain what Amazon EMR is and its use cases.
● Why have you used EMR?
● What is the difference between Hadoop and Amazon EMR?
● What is Amazon S3, and how is it used for object storage?
● How can you secure data stored in Amazon S3?
● Explain the S3 storage classes and their use cases.
● How can you monitor and manage costs in AWS?
● Features of Cloud?
● Benefits of RDS over on-premise RDBMS?
● What is Amazon Redshift, and how does it differ from traditional relational databases?
● Explain the concept of data warehousing and how Redshift fits into it?
● How can you optimize query performance in Amazon Redshift?
● How can you monitor and manage costs in AWS?
● Changes dependent on Athena?
● Query execution engine in Athena?
● Max size of file upload in S3?
● "What is AWS Glue, and how does it fit into the AWS ecosystem? "
● How does AWS Glue simplify the ETL process for big data?
● What are AWS Glue Jobs and how do they work?
● Can you explain the Glue Data Catalog and its significance?
● How does AWS Glue integrate with other AWS services, such as S3 and Redshift?
● "Where can you schedule Glue Jobs? "
● Where can you check the logs of Glue Jobs?
● What is Amazon Athena, and how does it query data stored in Amazon S3?
● What is Amazon Hadoop distribution?
● What is the default Engine for Athena in AWS
● what is IAM?
● what is VPC? subnet?
● route table , inbound outbound rules?
● s3 storage classes? when we use standard?
● local to EC2 connection?
● what is glacier?
● what is normalization and denormalization?
● when we go for normalization ?
● SCD types?
● OLTP & OLAP difference?
● Star and snowflake schema ?
● what is fact & dimension tables and their relation?
● how they are co-related to each other Fact and Dimension tables?