# Data Ingestion Layer

The data ingestion layer is responsible for capturing and pre-processing blockchain data across multiple networks.

### Data Sources

* Accessing 40+ blockchains including Ethereum, Solana, Polygon, and others
* Aggregating data from DEXs, contract registries, and token metadata providers
* Real-time mempool and state diff indexing

### Processing Pipeline

* Streaming ingestion and real-time processing via Apache Kafka and Flink
* Deduplication and schema alignment with error correction
* Transaction and wallet tagging

### Output

* Raw blockchain data is transformed into normalized, query-ready formats (Avro, Parquet)
* Transaction objects enriched with wallet tags for deep analytics
