What is Amazon Athena?
Athena acts as a server-less query service. It’s entire design purpose is built around affording you the ability to have large quantities of Amazon S3-stored data quickly analyzed through Standard SQL.Amazon describes Athena as a serverless interactive query service. Athena accesses data directly from S3 with setting up any servers, frameworks, clusters or other tools other than getting the data loaded to S3.
— Rapid query results without having to worry about tuning queries or optimizing database structures.
— Amazon S3 stores the data, there is no need for businesses to invest in physical IT infrastructure to query and store their information.
— an interactive query service that makes it easy to analyze data in Amazon S3 using standard SQL.
— easy to use. Simply point to your data in Amazon S3, define the schema, and start querying using standard SQL.
— no need for complex ETL jobs to prepare your data for analysis. This makes it easy for anyone with SQL skills to quickly analyze large-scale datasets.
— quickly query your data without having to setup and manage any servers or data warehouses
— all your data in S3 without the need to set up complex processes to extract, transform, and load the data (ETL).
— underlying data store, making your data highly available and durable.
— quick, ad-hoc querying but it can also handle complex analysis, including large joins, window functions, and arrays. Amazon Athena is highly available
— no infrastructure to manage, and you pay only for the queries that you run.
Athena stand out and easy to use for big and small companies, especially if they are already using or are planning to use AWS S3.
Amazon Athena Work
— Athena don’t want to waste inordinate amounts of time with scanning, loading, and indexing data sets.
— The solution is Point-and-shoot functionality.
— Point Athena to the data in question within your Amazon Simple Storage Service (S3).
— Run your fields and queries, and then Athena returns the results within moments.
— Athena simply works quietly and efficiently to parallelize your query
Amazon Athena Benefits
— Athena is server less built-in query editor allows you to point your data in Amazon S3, and prepare for analysis without the need for ETL processes.
— Open Source Design includes ANSI SQL support, and plays nice with standard formats including ORC, Parquet, JSON, and CSV.
— Only Pay what you run you only open up your wallet for the queries that you need to run.
— Faster Execution lightning-quick query performance through automatically executing all queries in parallel.You can expect results in mere seconds!
ETL (Extraction, Transformation, Loading)
— Extracts data from homogeneous or heterogeneous data sources
— Transforms the data for storing it in proper format or structure for querying and analysis purpose
— Loads it into the final target (database, more specifically, operational data store, data mart, or data warehouse)
— Query execution time at Athena can vary wildly.
— scanning around 15 GB of data of anywhere from 60 seconds to 2500 seconds (~40 minutes)
— scanned around 500 MB in 1800 seconds (~30 minutes)