Created metrication for inter-proclet communication by Bokai-Bi · Pull Request #1 · Nu-NSDI23/Nu

Bokai-Bi · 2023-06-29T18:28:53Z

Added instrumentation for inter-proclet communication. For local communication, each proclet logs the total amount of function calls. For remote communication, each proclet logs the total amount of calls and the total size of data transferred on a per-target-machine basis. Data about local communication is stored in a Counter in the caller's header while data about remote communication is stored in a std::unordered_map also in the caller's header synchronized by its spin_lock. No data is logged on the callee's side.
Added a new benchmark bench/bench_proclet_logging, which benchmarks the performance of remote proclet communication. Only works when the main server and remote server are started under specific IPs specified in code.

inc/nu/impl/proclet.ipp

zainryan · 2023-07-01T14:44:14Z

inc/nu/impl/proclet.ipp

+
+      caller_header->spin_lock.lock();
+
+      auto target_kvpair = caller_header->remote_call_map.find(target_ip);


why the key is the "target_ip" not simply the "callee_id"?

For locality improvements we are interested in the total aggregate communication from a proclet to all other machines, using the ip of the callee allows us to collect data on a per-target-machine basis. Using callee_id as key will not be optimal since proclets can migrate

I actually feel that using the callee_id might be better as it tells us whether should be colocate certain proclet pairs to improve locality. Using the per-machine metric hides all these details.

zainryan · 2023-07-01T14:47:19Z

inc/nu/impl/proclet.ipp

+      ProcletSlabGuard slab_guard(&caller_header->slab);
+      NodeIP target_ip = get_runtime()->rpc_client_mgr()->get_ip_by_proclet_id(id);
+
+      caller_header->spin_lock.lock();


This will limit the invocation tput of the caller proclet to roughly 1MOPS

Why is that the case? (out of interest)

Is this because of the call to get_ip_by_proclet_id or the locking? The locking can theoretically be removed by replacing the map with an array that can be unsafely modified.

I mean the locking. Yeah making it lockless would be better.

zainryan

Thanks Bokai, it looks functionally correct! I have some comments wrt to the metric and synchronization, please look at the embedded comments.

…e calls

…d proclet logging benchmark

Bokai-Bi and others added 23 commits June 13, 2023 22:50

Slightly modified a comment

d265a0d

included unordered_map in proclet_mgr.cpp to print out header contents

3e0e834

bug fix

cee513a

Compiler error fixes

b626fee

Unit disabling

d4c2132

Update proclet.ipp

d47708a

Unit testing

88acf58

new debugging checkpoints

4360667

initialized map

aee42a5

Bug fixes

6f4652f

no-error solution that doesn't handle migration

c010341

Finished impl v1, logging benchmark WIP

7b6bf77

Finished benchmark test

0ede345

prepare repository for multi-machine test

ffd35a8

changed benchmark to non-distributed

48f1a26

pinned proclets in benchmark

a5bb778

Prepare for pull request, comments and code cleanup

a4e9680

Update test.sh to original settings

1d39a24

Update proclet_mgr.cpp

451c080

Cleaned proclet.ipp for pull request

0ba7d56

Update proclet_mgr.hpp

dea0f56

Reverted comments to test_condvar.cpp

820d081

Deleted start_controller.sh for pull request

1a1b9a7

zainryan reviewed Jul 1, 2023

View reviewed changes

inc/nu/impl/proclet.ipp Show resolved Hide resolved

zainryan reviewed Jul 1, 2023

View reviewed changes

Bokai-Bi added 3 commits July 6, 2023 11:53

Update proclet_mgr.hpp to change NodeIP to ProcletID in logging remot…

1028464

…e calls

Update proclet.ipp to use ProcletID instead of NodeIP as key

58632e6

Update proclet_mgr.cpp to reflect changing NodeIP to Proclet ID

dc66aa2

bokaibi and others added 5 commits July 10, 2023 12:52

Updated benchmark for multithreaded proclet logging performance

9bc2bd0

reduced thread number

72f27e0

Added instrumentation and test for compute intensity; slightly tweake…

949d147

…d proclet logging benchmark

Updated compute intensity benchmark

879c75d

Update bench_proclet_logging.cpp for maximum lock contention

61a41ff

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Created metrication for inter-proclet communication#1

Created metrication for inter-proclet communication#1
Bokai-Bi wants to merge 31 commits intoNu-NSDI23:mainfrom
Bokai-Bi:pull-to-original

Bokai-Bi commented Jun 29, 2023

Uh oh!

Uh oh!

zainryan Jul 1, 2023

Uh oh!

Bokai-Bi Jul 4, 2023

Uh oh!

zainryan Jul 5, 2023

Uh oh!

zainryan Jul 1, 2023

Uh oh!

ms705 Jul 3, 2023

Uh oh!

Bokai-Bi Jul 4, 2023

Uh oh!

zainryan Jul 5, 2023

Uh oh!

zainryan left a comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants


		caller_header->spin_lock.lock();

		auto target_kvpair = caller_header->remote_call_map.find(target_ip);

Conversation

Bokai-Bi commented Jun 29, 2023

Uh oh!

Uh oh!

zainryan Jul 1, 2023

Choose a reason for hiding this comment

Uh oh!

Bokai-Bi Jul 4, 2023

Choose a reason for hiding this comment

Uh oh!

zainryan Jul 5, 2023

Choose a reason for hiding this comment

Uh oh!

zainryan Jul 1, 2023

Choose a reason for hiding this comment

Uh oh!

ms705 Jul 3, 2023

Choose a reason for hiding this comment

Uh oh!

Bokai-Bi Jul 4, 2023

Choose a reason for hiding this comment

Uh oh!

zainryan Jul 5, 2023

Choose a reason for hiding this comment

Uh oh!

zainryan left a comment

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants