Log Archival Bench How To

Setup

Initialize and update submodules:

git submodule update --init --recursive

Run the following code to setup the virtual environment, add the python files in src to python's import path, then run the venv

python3 -m venv venv

echo "$(pwd)" > $(find venv/lib -maxdepth 1 -mindepth 1 -type d)/site-packages/project_root.pth

. venv/bin/activate

pip3 install -r requirements.txt

Download Datasets

You can download all the datasets we use in the benchmark using the download_all.py script we provide.

The download_all.py script will download all datasets into the correct directories with the specified names, concentrate multi-file datasets together into a single file, and generate any modified version of the dataset needed for tools like Presto + CLP.

Run Everything

Follow the instructions above to set up your virtual environment.

Stay in the Log Archival Bench directory and run scripts/benchall.py. This script runs the tools + parameters in its "benchmarks" variable across all datasets under data/.

Run One Tool

Execute ./assets/{tool name}/main.py {path to <dataset name>.log} to run ingestion and search on that dataset.

Contributing

Follow the steps below to develop and contribute to the project.

Requirements

Task 3.40.0 or higher

Linting

Before submitting a pull request, ensure you've run the linting commands below and have fixed all violations and suppressed any benign warnings.

To run all linting checks:

task lint:check

To run all linting checks AND fix some violations:

task lint:fix

To see how to run a subset of linters for a specific file type:

task -a

Look for tasks under the lint namespace (identified by the lint: prefix).

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.github		.github
assets		assets
data		data
docs		docs
scripts		scripts
src		src
taskfiles/lint		taskfiles/lint
tools		tools
.coderabbit.yaml		.coderabbit.yaml
.gitignore		.gitignore
.gitmodules		.gitmodules
LICENSE		LICENSE
README.md		README.md
lint-requirements.txt		lint-requirements.txt
requirements.txt		requirements.txt
taskfile.yaml		taskfile.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Log Archival Bench How To

Setup

Download Datasets

Run Everything

Run One Tool

Contributing

Requirements

Linting

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

License

y-scope/log-archival-bench

Folders and files

Latest commit

History

Repository files navigation

Log Archival Bench How To

Setup

Download Datasets

Run Everything

Run One Tool

Contributing

Requirements

Linting

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages