Skip to content
fiwang edited this page Nov 2, 2015 · 7 revisions

Overview

Welcome to the Pulsar Reporting WiKi page!

Pulsar Reporting Framework is an open-source, extensible data visualization and reporting framework designed to provide real-time insights for Pulsar Pipeline.It provides a rich set of charting widgets and a visual reporting editor for user to easily create reports. It has a robust data query engine that can be extended to support many different types of data sources. With Pulsar Reporting Framework, user can quickly create multi-dimensional and interactive reports with drill-down and slice-and-dice capabilities.

Features

  • Near real-time reports - Building reports based on near real-time data that auto-refresh at certain interval
  • Visual reporting editor - Generating reports without writing any code
  • Rich charting widgets - Line, bar, histogram, pie, stack, datatable, etc.
  • Reporting API supporting both SQL and structured JSON queries - Querying data with human-friendly SQL or programming-friendly structured JSON
  • Dynamic data source management - No downtime for adding or removing data sources
  • Security and permission - Authentication and access control management
  • Druid Kafka extension - Ingesting real-time data from Kafka into Druid
  • AngularJS based Hierarchical UI framework - Easily adding and extending reports
  • Bootstrap based responsive design - Looking great on different sizes of screens

Why Pulsar Reporting Framework

Pulsar Reporting Framework complements Pulsar, an open-source, real-time analytics platform and stream processing framework. Pulsar generates huge amount of data and visualization is the best way to provide intuitive and meaningful insights into those data. However, building dashboards and reports for big data from scratch is cumbersome and error-prone. Pulsar Reporting Framework allows user to create reports easily and quickly without requiring complex data processing and UI logic.

Architecture

The raw events and session events from Pulsar Pipeline are flowing to Kafka using Pulsar Kafka channel. Then the Druid cluster ingests the raw events and sessions from Kafka topics into two tables, one for sessions and one for events. Both tables are indexed in one second granularity to enable realtime reporting capability. Pulsar Reporting API provides an abstract layer to access the tables, and the Reporting UI gets the data from the API to build different charts.

Documentation

  • [Getting Started](Getting Started) to get started and acquainted with Pulsar Reporting.
  • Pulsar Reporting UI User Guide to get a deep understanding on the UI framework usage.
  • [Pulsar Reporting API User Guide](Pulsar Reporting API User Guide) to get a deep understanding on the Pulsar Reporting API usage.

Key Contributors

  • Tony Ng
  • Nemo Chen
  • Zhiyi Liu
  • Ken Wang
  • Rincon Miguel
  • Qinhua Xing
  • Xin Xu
  • Julian Pan
  • Xiaoming Zhang
  • Rick Tao

Acknowledgments

  • Sharad Murthy
  • Xinglang Wang
  • Lisa Li
  • Grace Li
  • Fiona Wang
  • Dror Engel
  • Edward Jan
  • Jiahong Han
Clone this wiki locally