JCollectd

This project aims to be a Java (relatively) lightweight alternative to more complete but more complex monitoring system like Nagios. It has simplicity on his mind: no external runtime dependency except Java 21, just one jar file, one configuration file and one (single file) sqlite database.

The program does the following:

reads the configuration file
collects system information reading /proc virtual filesystem on Linux, or via sysctl on FreeBSD
writes collected samples into a sqlite database
provides a simple embedded Angular web application, featuring nice reports made with Apache ECharts
rinse and repeat until stopped

Build instructions

To build the application, you will need the following software to be installed on your system:

Move into the web directory, and build the web component:

$ npm install
$ npm run build

Copy the newly produced content of web/dist/jcollectd/browser into the static resources of the Java application, located at src/main/resources/web.

Move back into the root directory and build the jar:

$ mvn clean package

An executable file jcollectd.jar will be produced into target directory. Copy it whenever you want, it's portable.

More conveniently, you can use the provided Dockerfile to build the application, without any dependency except Docker itself, with the following command:

$ docker build --target=out --output=out .

Of course, you will still need Java 21 to run the application.

Configuration

Configuration is done via single YAML file. Supported parameters are:

Parameter	Mandatory	Default value	Description
`verbose`	no	false	Boolean value to set log verbosity. Normally the application will print its configuration when starting, and fatal errors when they occur, in verbose mode will print all debug lines.
`hostname`	no	autodetect	The machine name, used to customize html report. If not provided, the program tries to autodetect it; if it fails, it falls back to `localhost`
`interval`	no	PT1M	Interval between samplings, in case you want to customize the granularity, but one minute is a safe and sane default. Expressed as Duration
`retention`	no	PT12H	Time window of data to keep in the database and to draw charts. Expressed as Duration
`port`	no	8080	HTTP port to bind the webserver to
`probes`	yes	-	List of defined probes. You must define at least one probe or the program will refuse to start

Each probe can be configured by the following parameters:

Parameter	Mandatory	Default value	Description
`type`	yes	-	The probe type, can have one of the following values: `load`, `cpu`, `mem`, `net`, `disk`, `zfs`, `gpu`, see below for details
`size`	no	`full`	The chart size, can be full or half page width
`device`	yes	-	Probes `net`, `disk` and `zfs` require the device you want to monitor, respectively the name of the network interface, or the block device, or the ZFS dataset. This parameter is ignored for other probe types
`label`	no	the value of `device` parameter	Used when you want to customize the device name shown in the chart, with a more meaningful value (e.g. `LAN` and `WAN` instead of `eth0` and `eth1`)

Probe type	Description
`load`	enables average load sampling
`cpu`	enables CPU percent utilization sampling
`mem`	enables memory, swap and cache sampling
`net`	enables network traffic sampling
`disk`	enables block device usage sampling
`zfs`	enables ZFS dataset usage sampling (currently on FreeBSD only)
`gpu`	enables GPU usage sampling (currently with Nvidia cards only, and `nvidia-smi` is required to be installed)

Additional information

disk probe supports the aggregation of devices, with a + separated list of devices to be aggregated, in case of software RAID arrays or ZFS pools.
Linux provides I/O totals for mdadm raid arrays, so you have the choice to probe the logical amount of disk activity (using the array itself as device, e.g. device: md0), or the aggregation of single disks composing the array (e.g. device: sda+sdb+sdc in case you have a 3-disk RAID5).
FreeBSD, on the other hand, provides totally different mechanisms to retrieve the two types of readings, so you can opt for a specific zfs probe to get the first, or a standard aggregated disk probe for the latter. Notice that ARC is involved in the calculation returned by the kernel, so if you read a 1GB that is totally in cache, zfs probe will report the reading, disk will not.

Sample configuration file

retention: PT24H
probes:
  - type: load
    size: full
  - type: cpu
    size: half
  - type: mem
    size: half
  - type: net
    size: half
    device: ix0
  - type: disk
    size: half
    device: ada0+ada1+ada2+ada3+ada4+ada5
    label: zstore

Usage

Simply execute the jar in background, with a command like this (assuming Java binary is in PATH)

nohup java -Xms32m -Xmx32m -jar jcollectd.jar config.yaml > jcollectd.log &

Run as service

In the artifacts/service folder you will find examples to run JCollectd as OS service, in rc and systemd there are simple rc script and systemd unit files for integration with FreeBSD and Linux init system, respectively.

Notes

It is worth saying that:

The memory footprint is relatively low, a few MB of heap size is enough to run the daemon with a reasonable configuration, but you may want to raise maximum heap size (depending on your dataset size).
Even with a very low heap, some memory will be consumed by internal mechanisms of sqlite memory allocation, this will be native memory and cannot be tuned via Java parameters.
Logging facility is provided by log4j2. The program, at default verbosity, logs only on the console a brief recap of what has been parsed from configuration file during startup, and any unrecoverable error that will prevent a correct monitoring, causing the program to exit. So there is no need to rotate log file (which is actually impossible with logrotate because Java ignores HUP signals), a single log file will be enough to discover if something is going wrong, and why.

Name		Name	Last commit message	Last commit date
Latest commit History 277 Commits
artifacts/service		artifacts/service
src/main		src/main
web		web
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
lombok.config		lombok.config
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Uh oh!

Repository files navigation

JCollectd

Build instructions

Configuration

Additional information

Sample configuration file

Usage

Run as service

Notes

Contributions, critics, suggestions are always welcome. Cheers!

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 3

Uh oh!

Languages

Uh oh!

License

Uh oh!

GilGalaad/JCollectd

Folders and files

Latest commit

History

Repository files navigation

JCollectd

Build instructions

Configuration

Additional information

Sample configuration file

Usage

Run as service

Notes

Contributions, critics, suggestions are always welcome. Cheers!

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 3

Uh oh!

Languages

Packages