Multithreading via C++ thread pool of clients #125

kentslaney · 2023-12-14T00:17:49Z

I've implemented the C++ thread workers to be FILO under the assumption that reusing a single client is preferable when possible. The current implementation needs C++17 or above to parallelize some of the client setup (std::execution). Building still fails with the required version, but I figured I'd open a draft PR anyway in case there's feedback. I'd especially appreciate advice about a good testing setup for the concurrent code along with any opinions about language bindings.

lexdene · 2023-12-14T02:56:58Z

World you please show a benchmark about the performance improvement your modifications have?
If there is no obvious improvement, I'd like to do the multithreading stuffs in Python code rather than inside the code of libmc.

kentslaney · 2024-02-07T04:58:55Z

Alright, I think this is ready to be reviewed/merged. I also rewrote some parts of the README unrelated to this PR for the sake of clarity.

I'll start tracking gevent support for threaded clients in a greenify PR. Ideally that shouldn't affect the implementation here.

Edit: actually, one more thing; I meant to look into golang bindings

kentslaney · 2024-02-08T16:01:37Z

It looks like gomemcache is already thread safe, so I'm back to thinking this PR is done.

kentslaney · 2024-05-31T21:28:01Z

I've closed the greenify PR for being unsolvable. Unless you have feedback, this pull request is done. Please squash & merge.

Thanks for your time, I know this is a lot of code to add at one time. The reason that I made the fork is already satisfied by having it under my profile, so the pull request really is up to you.

lexdene · 2024-06-03T09:13:13Z

I think the changes under .github/workflows directory are not related.
Would you please revert all changes under this directory?

I do not agree with some changes of readme.
Additionally, I do not want to discuss much about readme here which I think is not related.
Would you please revert all changes in readme and submit another pull request which only contains changes of it so that we can discuss specifically and pointedly.

What is misc/aliases used for?
I have not found any code referencing this file.
If it is possible to delete misc/aliases,
misc/git/debug.patch and misc/git/pre-commit which are only referenced by misc/aliases can also be deleted.

Changes about greenify are also not related I think.

Sorry for replying so slowly, but it is really not easy to look over such a big pull request.
I will be happier if it is splitted into several small ones.

kentslaney · 2024-06-04T19:20:29Z

I think I've now removed all the parts of the pull request that aren't necessary for the ThreadedClient interface in Python.

Quick summary of the parts removed:

.github/workflows/golang.yml and .github/workflows/python.yml: changes for nektos/act
.github/workflows/manual.yml: shorter, manual tests for faster iteration
.gitignore: cygdb (cython debugger) uses /cython_debug
Makefile: make clean
README.rst: discussed above
include/Common.h, src/Common.cpp, and src/Connection.cpp: better naming for some of the things in Add support for UNIX domain sockets #120
misc/aliases: executable with shortcuts for commands I regularly used during development
misc/git/*: created git hooks that automatically un-staged the setup.py changes I needed for debugging Cython
misc/memcached_server: logs all port traffic for debugging reasons
misc/runbench.py: raises an exception for tests with incorrect output instead of potentially clogging stdout
setup.py: actually I don't remember why this one was there, but it removes some process cleanup steps apparently

I'm currently expecting to make separate PRs to add back the changes in:

README.rst
include/Common.h, src/Common.cpp, and src/Connection.cpp

Here's some ideas on parts that can further split up the changes, along with what I see as the benefits and reasons not to.

benchmarking changes needed for threading
- pros
  - allows the functional parts to be merged with less code to review
- cons
  - mostly just moves the changes in runbench.py to a different PR
  - removes an integration test for the threaded code
separate the C threading interface from the python one
- pros
  - fairly even split, significant reduction in largest PR size
- cons
  - they're already mostly organized by file type
  - wouldn't clarify the changes in either
  - the python interface, most likely to be user facing, has the same critical path

Please let me know if there's any changes that would make reviewing easier.

kentslaney · 2024-06-05T06:01:52Z

src/ClientPool.cpp

+  std::atomic<int> rv = 0;
+  std::lock_guard<std::mutex> updating(m_fifo_access);
+  std::for_each(irange(0), irange(m_clients.size()),
+  //std::for_each(std::execution::par_unseq, irange(0), irange(m_clients.size()),


The loops for initializing and growing the client pool (doubles in size each time it's overrun) were written using for_each in order to support parallelized initialization of the client objects, but, at the scales I tested it at, the parallelization didn't make a difference, thus the commented out execution policy.

kentslaney · 2024-06-05T06:05:38Z

misc/runbench.py

+        self.config(libmc.MC_MAX_CLIENTS, POOL_SIZE)
+
+
+class FIFOThreadPool(ThreadPool):


This class is analogous to the C++ LockPool class and might be an easier way to understand the overall control flow

kentslaney · 2024-06-06T04:09:25Z

🎉 🎉 🎉

kentslaney added 7 commits December 11, 2023 00:28

switching machines

a6e238b

switch back...

22b1e47

LockPool

9fe14b3

ClientPool

02c941d

missed renamed symbol

3fb5dd7

cppcheck fixes

b7305ca

minor build fixes

c936fdf

kentslaney added 22 commits December 14, 2023 11:42

std::execution::par[_unseq] requires -fexceptions

a4662c2

act support

f5eca1c

python bindings

448458b

avoiding iota

b5410a3

switching machines

2324b0f

still broken...

1e9f944

...with a different error though

6c860c3

stranger error

9ce5327

well that feels silly after the fact

07964a8

redundancies

550ef71

partially fix cython integration

f099e68

reset cmake

cfdd1b4

python class

e4af40d

remove header...

f538b6e

__cinit__ semantics

f0b37f1

split up tests

4d7ecdb

flustered and cygdb can't find debugging logs

87b834f

mild QOL patch

df5f3d3

trailing whitespace

5eff5b3

forgot to init ClientPool

6285a81

pre-commit still fails on macos

167d123

made applying debug patch easier

9416571

kentslaney added 9 commits February 6, 2024 14:58

update cgo CXXFLAGS

b1263d1

fix double acquire attempt for m_pool_lock

36f1644

explicit irange constructor

ccd20a5

README update

7e7dfae

link format

1c8c6bc

typo

399e4e6

md -> rst

728ddca

remove dead link

d561b84

clarifying statistics

1f8dd88

kentslaney marked this pull request as ready for review February 7, 2024 04:16

kentslaney added 2 commits February 6, 2024 22:03

better footnote about traffic stats

87c4e64

wording

78a5431

kentslaney added 3 commits May 31, 2024 13:46

gevent support explanation

f8ecc8e

version bump

51e627f

remove gevent test for ThreadedClient

1327704

kentslaney added 2 commits June 4, 2024 10:21

Separation of concerns

f0263ad

mistake in benchmark thread spawning

081db1d

kentslaney commented Jun 5, 2024

View reviewed changes

lexdene approved these changes Jun 5, 2024

View reviewed changes

lexdene merged commit fb03c5d into douban:master Jun 5, 2024
28 checks passed

kentslaney mentioned this pull request Jun 6, 2024

Better naming for UNIX socket function #126

Merged

kentslaney added a commit to kentslaney/libmc that referenced this pull request Jun 6, 2024

readme update from douban#125

362921b

lexdene pushed a commit that referenced this pull request Jul 29, 2024

README update from #125 (#127)

9987819

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multithreading via C++ thread pool of clients #125

Multithreading via C++ thread pool of clients #125

kentslaney commented Dec 14, 2023

lexdene commented Dec 14, 2023

kentslaney commented Feb 7, 2024 •

edited

Loading

kentslaney commented Feb 8, 2024 •

edited

Loading

kentslaney commented May 31, 2024 •

edited

Loading

lexdene commented Jun 3, 2024

kentslaney commented Jun 4, 2024

kentslaney Jun 5, 2024 •

edited

Loading

kentslaney Jun 5, 2024

kentslaney commented Jun 6, 2024

		self.config(libmc.MC_MAX_CLIENTS, POOL_SIZE)


		class FIFOThreadPool(ThreadPool):

Multithreading via C++ thread pool of clients #125

Multithreading via C++ thread pool of clients #125

Conversation

kentslaney commented Dec 14, 2023

lexdene commented Dec 14, 2023

kentslaney commented Feb 7, 2024 • edited Loading

kentslaney commented Feb 8, 2024 • edited Loading

kentslaney commented May 31, 2024 • edited Loading

lexdene commented Jun 3, 2024

kentslaney commented Jun 4, 2024

kentslaney Jun 5, 2024 • edited Loading

Choose a reason for hiding this comment

kentslaney Jun 5, 2024

Choose a reason for hiding this comment

kentslaney commented Jun 6, 2024

kentslaney commented Feb 7, 2024 •

edited

Loading

kentslaney commented Feb 8, 2024 •

edited

Loading

kentslaney commented May 31, 2024 •

edited

Loading

kentslaney Jun 5, 2024 •

edited

Loading