Accuracy Evaluation

This crate is used to evaluate the accuracy of the Resolution Pipeline of "Cargo Ecosystem Monitor".

View Evaluation Results NOW

We store our evaluation results in the file EDG_Evaluation_20220811.zip and can be extracted and viewed directly.

After extracting result file in this directory, run cargo run --bin summary_release in your shell and you can see the summary of the results.

Re-evaluate Our Pipeline

Before executing any programs in this project, you should:

build your crates postgresql database from Crates.io first. dbname=crates user=postgres password=postgres.
Run project rust_deps to build table dep_version.
In step 2, you need to set .cargo environment to specify certain cargo index cache. In this project, you also need to do so, so that standard benchmark uses the same index.
1. Override configuration to file ~/.cargo/config.toml with

[net]
git-fetch-with-cli = true

[source.cargo_ecosystem_monitor]
registry = "file:///absolute/path/to/crates.io-index/dir" 

[source.crates-io]
replace-with = "cargo_ecosystem_monitor"

If you are using the provided docker, you can directly run make replace_cargo_mirror before running the evaluation process. And after the evaluation process, run make restore_cargo_mirror to remove the configurations. Make sure you know what is going to happen when you run it in your host machine.
Run scripts Code/scripts/prebuild.sql to build neccesary tables.
Run cargo run --bin autorun to automatically start the evaluation process, which will run three separate programs for each dataset:
1. Run cargo run --bin benchmark_dataset under this project. This will automatically generate dataset under directory output.
2. Run cargo run --bin pipeline_evaluation under this project. This will automatically generate pipeline resolution results under directory output, and also store comparison results.
3. Run cargo run --bin results_summary under this project. This will automatically analyze comparison results and print them in command line.

WARNING: As it override your local Cargo configuration, you should not do anything related to Rust and Cargo to avoid unexpected behavior when running this program or use this configuration! Reset Cargo configuration (remove our ~/.cargo/config.toml file) after execution.

When re-run the program, you have to manually clear all the cache data:

Delete directory output to avoid reconsidering duplicate crates in results summary.
Delete DB table accuracy_evaluation_status to clear all current status.
If you want to continue the stopped process, just re-run. Nothing needs to be done.
If you want to change the dataset stratety, you should change the code. Also, do as re-run.

The evaluation machanism works as follows:

Find top 2000(customized) crates with most direct dependencies.
Download crate source code from official database.
Use Cargo to resolve the dependencies of each crate in local and real environment.
Compare the resolution results with database created by our Resolution Pipeline.

Some differences can be tolerated, like new crates are published between resolution and evaluation.

Architecture

We have three binary programs, they should be executed in order:

benchmark_dataset: It downloads top crates, resolve them by using cargo tree and store results in local.
pipeline_evaluation: This process should be executed after building database dep_version using project rust_deps. It compares cargo tree dependency results with the ones resolved by our dependency resolution pipeline, which is rust_deps. The pipeline resolution and comparison results are stored in local.
results_summary: This process will summarize all comparison results and give final judgement.

Break-point Continuingly-transferring

To maintain the resolving process, we build a database table accuracy_evaluation_status. In status field, there are three possible values, which are unevaluated, resolved, evaluated, fails.

unevaluated: Not touched.
resolved: Resolved by cargo tree.
evaluated: Resolved by both cargo tree and pipeline. Work done.
fails: Crate resolution process fails, and won't be continued. Mostly caused from downloader.

Inaccuracy Types

Our pipeline resolution results may differ from standard results. The main reasons are:

Dependency entanglement: Due to Cargo dependency cache mechanism, used dependencies are influenced by unused dependencies at certain time. The cache mechanism will merge crates if different dependencies requirements can be satisfied. But that introduces uncertainly transparent to the crate developer as the dependency changed from what they think it should be.
- Example: Crate p2pands-rs-v0.4.0 depends on openmls-v0.4.1, which optionally depends on rstest-v0.13.0. At the same time, p2pands-rs-v0.4.0 has a development dependency on openmls-v0.4.1 with more features on. However, the crate openmls-v0.4.1 with more features on will be selected, as they are merged to this one. As a consequence, rstest-v0.13.0 is dependent on. This change the behavior of what developers expected, as more features are opened by default. The code behavior might changed.-

Name		Name	Last commit message	Last commit date
Latest commit History 1 Commit
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
EDG_Evaluation_20220811.zip		EDG_Evaluation_20220811.zip
Makefile		Makefile
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Accuracy Evaluation

View Evaluation Results NOW

Re-evaluate Our Pipeline

Architecture

Break-point Continuingly-transferring

Inaccuracy Types

About

Uh oh!

Releases

Packages

Uh oh!

Languages

Rust-Hell/Rust-Local-Dependency-Resolver

Folders and files

Latest commit

History

Repository files navigation

Accuracy Evaluation

View Evaluation Results NOW

Re-evaluate Our Pipeline

Architecture

Break-point Continuingly-transferring

Inaccuracy Types

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages