Skip to content

Latest commit

 

History

History
251 lines (165 loc) · 8.3 KB

README.md

File metadata and controls

251 lines (165 loc) · 8.3 KB

jlf

Crates.io MIT licensed

cat ./examples/dummy_logs | jlf

Given some log file as below:

./examples/dummy_logs

{"timestamp": "2024-02-09T07:22:41.439284", "level": "DEBUG", "message": "User logged in successfully", "data": {"user_id": 3175, "session_id": "Nsb3P5mZ7971NFIt", "ip_address": "149.215.200.169", "action": "login", "success": false, "error_code": null}}
{"timestamp": "2024-02-09T07:22:42.439284", "level": "ERROR", "message": "Database connection established", "data": {"user_id": 8466, "session_id": "ZMOXKPna3GbzWz2N", "ip_address": "213.135.167.95", "action": "logout", "success": true, "error_code": null}}
...

It will output a more colorful and readable log like below:

Screenshot 2024-02-09 at 12 23 12 PM

Installation

Cargo

cargo install jlf

Manual Installation

You can also clone the repo and install it manually.

git clone https://github.com/PoOnesNerfect/jlf.git
cd jlf
cargo install --path . --locked

CLI Options

$ jlf -h

JSON Log Formatter CLI

Usage: jlf [OPTIONS] [FORMAT_STRING]

Arguments:
  [FORMAT_STRING]  [default: "{#log}{#if spans|data}\\n{spans|data:json}{/if}"]

Options:
  -n, --no-color     Disable color output. If output is not a terminal, this is always true
  -c, --compact      Display log with data in a compact format
  -s, --strict       If log line is not valid JSON, then report it and exit, instead of printing the line as is
  -t, --take <TAKE>  Take only the first N lines
  -h, --help         Print help
  -V, --version      Print version

Custom Formatting

You can optionally provide your custom format of the output line.

cat ./examples/dummy_logs | jlf '{#log}{#if spans|data}\n{spans|data}{/if}'

Supplied format above is the default format, so it will output the same as the default format.

{#log} is a function log, which is a convenience function that prints the basic log format. Currently, function feature is very early and only log function is available. log function is equivalent to the format {timestamp:dimmed} {level|lvl:level} {message|msg|body}.

{#if spans|data}...{/if} is a function if, which is a conditional function that prints the content inside the block if the field spans or data exists. More about the function's behavior in {#if}

\n will print a newline.

{spans|data:json} will print the spans or data field as json.

{timestamp:dimmed} means that the cli will look for timestamp in the json and print it with dimmed dimmed.

level|lvl means that the cli will look for level and lvl in the json and use the first one it finds. The style is also called level, which is a special style that will color the level based on the level (debug = green, info = cyan).

{message|msg|body} means that the cli will look for message, msg, and body in the json and use the first one it finds.

Printing the entire JSON

If you want to print the entire JSON line, you can just use {}.

cat ./examples/dummy_logs | jlf '{}'

You can still provide styles to it.

cat ./examples/dummy_logs | jlf '{:compact,fg=green}'

Styling

You can provide styles to the values by providing styles after the :.

cat ./examples/dummy_logs | jlf '{timestamp:bright blue,bg=red,bold} {level|lvl:level} {message|msg|body:fg=bright white}'

If you have multiple styles, you can separate them with ,, like fg=red,bg=blue.

You can optionally provide the style type before the =. If you don't provide it, it will default to fg.

Available attributes

  • dimmed: make the text dimmed
  • bold: make the text bold
  • fg={color}: set the text color
  • {color}: same as fg={color}
  • bg={color}: set the background color
  • indent={n}: indent the value by n spaces
  • key={color}: sets the color of the key
  • value={color}: sets the color of the value
  • str={color}: sets the color of the string data type
  • syntax={color}: sets the color of the syntax characters
  • json: print the json value as json; this is the default and only available format, so you don't have to specify it
  • compact: print in a single line
  • level: color the level based on the level (debug = green, info = cyan)

{color} is a placeholder for any color value.

Available Colors

You can view all available colors in colors.md.

Functions

Functions in jlf start with # inside {}.

log

log is a convenience function that prints the basic log format.

cat ./examples/dummy_logs | jlf '{#log}'

equals

cat ./examples/dummy_logs | jlf '{timestamp:dimmed} {level|lvl:level} {message|msg|body}'

and will print

2024-02-09T07:22:41.439284 DEBUG User logged in successfully

if

You can use {#if field}...{/if} to conditionally print the content inside the block if the field exists.

the condition only checks if the field exists (or is null) or not, but not the truthiness of the field.

If the field exists and the value is false, it will still print the content inside the block.

cat ./examples/dummy_logs | jlf '{#if spans|data}data: {spans|data:json}{/if}'

will print data: { ... } only if spans or data field exists.

Neat Trick

  • If the line is not a JSON, it will just print the line as is.
  • It removes all ANSI escape codes when piping to a file.

This means, you can just use jlf for non-JSON logs to pipe logs to a file without all the ansi escape codes. When you just pipe it to a terminal, it will still style the logs as before.

Neat, right?

Implementation

JSON Parsing

The program cannot assume what the data structure of the incoming JSON logs will be. There is no guarantee that the application that is piping the logs uses the best practices for logging, or keep the consistent structure.

Thus, it must be able to parse any JSON log dynamically; that leaves us with having to use serde_json::Value.

But, can we do better? The answer is yes.

Although we cannot assume the data structure of the logs, we can still optimize for the common characteristics of JSON logs. So, I decided to make a custom JSON parser that is optimized for JSON logs.

Some characteristics of common json logs:

Below are some characteristics of common json logs that I thought I could optimize for:

  1. each log line is usually not super huge:
  2. log lines usually have similar structures:
  3. we don't need to transform data; we just reformat them.

Optimizations

Below are the optimizations I implemented for the corresponding items above:

  1. JSON objects are parsed into vec of key-value pairs instead of map.
    • this way, we don't have to allocate memory for each key and value.
  2. Since each line of JSON log has a similar structure, we can reuse the existing vecs that are already allocated.
    • we don't have to allocate memory for each line.
  3. Don't validate primitive values, since we don't need to transform the data.
  4. Instead of allocating new Strings for each key and value, we use &str slices of the log string.

Benchmarks

So, how did it perform? That's the only thing that matters.

custom parse time: [987.52 ns 993.59 ns 1.0006 µs]
Found 12 outliers among 100 measurements (12.00%)
9 (9.00%) high mild
3 (3.00%) high severe

serde value parse time: [2.8045 µs 2.8357 µs 2.8729 µs]
Found 8 outliers among 100 measurements (8.00%)
4 (4.00%) high mild
4 (4.00%) high severe

serde structured parse time: [712.16 ns 714.93 ns 717.54 ns]

First section is the custom parse, second is the parsing into serde_json::Value parse and third is deserializing into a structured rust object.

The time is how long it took to deserialize a single line of json log.

As we can see, our custom parser is about 3x faster than the serde_json::Value parsing. Yes, it is still slower than the structured parsing, but our parser is still pretty darn fast for parsing a dynamic JSON data.