Hive Metastore

The foundational metadata service for Hadoop ecosystem, still widely used for table and schema management.

CLI

About Hive Metastore

The foundational metadata service for Hadoop ecosystem, still widely used for table and schema management. Explore how Hive Metastore integrates with the agentic data stack ecosystem and supports autonomous data operations.

Key Features

  • Central metadata repository via Thrift interface for tables, partitions, databases, and functions
  • Industry-standard metastore protocol supported by Spark, Trino, Presto, Impala
  • Standalone metastore mode for running independently of the full Hive execution engine
  • RDBMS-backed persistence via DataNucleus ORM (Derby, MySQL, PostgreSQL, Oracle)
  • High availability through stateless architecture with multiple metastore instances
  • Schema management tooling (schematool) for initialization and upgrades
  • Bulk metadata operations (metatool) for NameNode migration and JDOQL queries
  • Multi-catalog support (Hive 3.0+) for logical metadata separation

Agent Integration

CLIschematool / beeline

$ Download Apache Hive binary tarball, set HIVE_HOME, run schematool -initSchema
CLI Documentation