Add dynamic replication to kernel NR #338

gz · 2023-09-25T04:14:03Z

No description provided.

vmwclabot · 2023-09-25T04:14:07Z

@gz, you must sign our contributor license agreement before your changes are merged. Click here to sign the agreement. If you are a VMware employee, read this for further instruction.

hunhoffe

Looks good! I left a few comments - most of them minor, and the few that aren't may be due to my own misunderstanding.

hunhoffe · 2023-09-25T18:12:03Z

kernel/src/arch/x86_64/rackscale/get_shmem_structure.rs

 use crate::nrproc::NrProcess;
 use crate::process::MAX_PROCESSES;

 /// Types of shared structures the client can request
 #[derive(Debug, Eq, PartialEq, PartialOrd, Clone, Copy)]
 #[repr(u8)]
 pub enum ShmemStructure {
+    // TODO(dynrep): remove NrProcLogs/NrLog add NodeReplicated<Process> and


I think we should remove this comment?

hunhoffe · 2023-09-25T18:23:04Z

kernel/src/scheduler/mod.rs

@@ -67,7 +67,6 @@ pub(crate) fn schedule() -> ! {
                            // There is no process but we're the "main" thread,
                            // aggressively try and advance the replica
                            let start = rawtime::Instant::now();
-                            crate::nrproc::advance_all();
                            crate::arch::advance_fs_replica();


Right now cnrfs isn't using dynamic replication, yes? Is that why advance_fs_replica() remains?

If so, I think that's just fine for now - no urgent need to port to cnrfs for the benchmarks we're looking at.

I'm asking because while the advance_fs_replica() logic remains, the tlb work queues remain large/complicated (#290). So it might be nice to document that implementing this for cnrfs would fix that issue :)

hunhoffe · 2023-09-25T18:23:46Z

kernel/tests/s04_user_runtime_tests.rs

@@ -31,6 +31,7 @@ fn s04_userspace_multicore() {
        .user_feature("test-scheduler-smp")
        .build();
    let cmdline = RunnerArgs::new_with_build("userspace-smp", &build)
+        .nodes(num_cores / 16)


Does this no longer work with full cores? It's unclear to me how 16 plays into this test.

hunhoffe · 2023-09-25T18:25:00Z

kernel/tests/s10_benchmarks.rs

                } else {
-                    cmdline = cmdline.nodes(machine.max_numa_nodes());
+                    cmdline = cmdline.nodes(std::cmp::max(machine.max_cores() / 16, machine.max_numa_nodes()));


Are we assuming max 16 cores per numa node? skylake4x has 24, so this may not be a solid assumption?

hunhoffe · 2023-09-25T18:25:21Z

kernel/tests/s10_benchmarks.rs

@@ -834,7 +834,7 @@ fn s10_leveldb_benchmark() {
 }

 #[test]
-fn s10_memcached_benchmark_internal() {
+fn s10_xmemcached_benchmark_internal() {


typo maybe?

hunhoffe · 2023-09-25T18:25:43Z

usr/init/src/vmops/mod.rs

@@ -35,7 +35,7 @@ fn maponly_bencher(cores: usize) {

    // see process.rs the heap split up by core from slots 1..128, so we start from there


Comment is out of date

hunhoffe · 2023-09-25T18:32:33Z

kernel/src/arch/x86_64/mod.rs

@@ -512,15 +531,12 @@ fn _start(argc: isize, _argv: *const *const u8) -> isize {
            crate::nrproc::register_thread_with_process_replicas();


Previously, the controller initialized the log but didn't have a replica of it's own, so the total number of process replicas was equal to the number of clients. Reading the code, I believe this is still true, but I wanted to confirm.

hunhoffe · 2023-09-25T18:33:33Z

kernel/src/memory/shmemalloc.rs

@@ -20,6 +20,7 @@ pub(crate) struct ShmemAlloc {
 }

 impl ShmemAlloc {
+    #[allow(dead_code)]


Do the logs use allocators at all? Can this be deleted entirely?

hunhoffe · 2023-09-25T18:35:54Z

kernel/src/arch/x86_64/process.rs

-            }
-            process_logs
-        };
+        // We want to allocate the logs in controller shared memory


I think comment is outdated. We are allocating below from local memory below (local_shmem_affinity())

hunhoffe · 2023-09-25T18:38:09Z

kernel/src/arch/x86_64/process.rs

+                            pcm.set_mem_affinity(mid_to_shmem_affinity(r)).expect("Can't change affinity");
+                        }
+                        AffinityChange::Revert(_orig) => {
+                            pcm.set_mem_affinity(local_shmem_affinity()).expect("Can't set affinity")


local_shmem_affinity() returns a value relative to where it's run. Also - the memory allocator may have been non-shared originally. So we don't want to automatically revert to shmem regardless. I think we need to capture the actual previous value to do this correctly.

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

gz requested a review from hunhoffe September 25, 2023 16:10

hunhoffe reviewed Sep 25, 2023

View reviewed changes

gz added 6 commits September 25, 2023 13:19

WiP on dynamic NR.

cc94d7a

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

Add submodule.

4d01e43

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

Migrate process code to new nr library.

9d7de0c

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

Make tests work with new node-replication code.

70d8d57

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

Update submodule.

2b02343

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

Compilable rack-scale code.

ad53903

Signed-off-by: Gerd Zellweger <mail@gerdzellweger.com>

gz force-pushed the new-replica-code branch from fe70c07 to ad53903 Compare September 25, 2023 20:19

hunhoffe closed this Nov 28, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add dynamic replication to kernel NR #338

Add dynamic replication to kernel NR #338

gz commented Sep 25, 2023

vmwclabot commented Sep 25, 2023

hunhoffe left a comment

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

hunhoffe Sep 25, 2023

		@@ -35,7 +35,7 @@ fn maponly_bencher(cores: usize) {

		// see process.rs the heap split up by core from slots 1..128, so we start from there

		@@ -512,15 +531,12 @@ fn _start(argc: isize, _argv: const const u8) -> isize {
		crate::nrproc::register_thread_with_process_replicas();

Add dynamic replication to kernel NR #338

Add dynamic replication to kernel NR #338

Conversation

gz commented Sep 25, 2023

vmwclabot commented Sep 25, 2023

hunhoffe left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment