Skip to content

Conversation

@Dimitrionian
Copy link
Collaborator

Add Apache HBase Provider

This PR introduces a new Apache HBase Provider for Apache Airflow, enabling seamless integration with Apache HBase - a distributed, scalable, big data store built on Apache Hadoop.

Features

  • Table Operations: Create, delete, and manage HBase tables with column families
  • Data Operations: Insert, retrieve, scan, and batch operations on table data
  • Backup & Restore: Full and incremental backup operations with restore capabilities
  • Monitoring: Sensors for table existence, row counts, and column values
  • SSL/TLS Support: Secure connections with certificate validation
  • Kerberos Authentication: Enterprise authentication with keytab support
  • Connection Pooling: High-performance connection pooling with optimized batch processing
  • Performance Optimization: Configurable batch sizes and parallel execution

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants