STM hash library for C (glib?)

Asked 8/11, 2009 at 19:24 Answered 20/2, 2012 at 9:7

I'm looking for some C library that includes STM-style (Software Transactional Memory) hash maps, but I had no luck so far. It would be great if it was based on glib / gobject, but it's not that crucial. It also doesn't need proper transactions over many objects - single immutable hash support is all I really need.

Must haves: immutable snapshot read, lock-free write with auto-retry.

Almemar answered 8/11, 2009 at 19:24 Comment(3)

Did you mean STL instead of STM? – Margalo 8/11, 2009 at 19:32

I think he means STM - Software Transactional Memory - since he's looking for things like snapshots and auto-retry: en.wikipedia.org/wiki/Software_transactional_memory But he should probably make that clear, since STM isn't a very well known area. – Interphase 8/11, 2009 at 19:47

Corrected the description. Michael is right. – Almemar 8/11, 2009 at 19:51

Wikipedia has a list of various STM implementations.

Sunglasses answered 8/11, 2009 at 20:3 Comment(1)

I looked through those. But they're very low-level. The best one for integration is probably stmmap: github.com/skaphan/stmmap. But I'm looking for something that already includes the hashmap implementation. Unfortunately I can't be sure that what some random data-structure library does is safe to just use on top of the stm library... I would prefer something working on a higher level. (ready data-structure + stm solution) – Almemar 8/11, 2009 at 20:31

Well, I think (and there are a number of study) that current STM isn't faster than lock-free and mutex-based code. It is obvious: STM requires on-line data conflict checking. However, such conflict checking in pure software requires very huge time overheads. Currently, only Sun's ROCK processor supports a limited form of STM (Best-effort HTM with STM) by hardware. No x86 CPUs currently support TM in hardware. In short, STM is just slow.

In my opinion, you'd better just using a concurrent hash table. For example, you can find concurrent_hash_map in Intel TBB. Here is the link of TBB Manual. Oh, but it's C++, not C. But, I do believe you can (though it might take significant work) translate C++-based such hash table to C code. Intel TBB is open source.

Also, I think such highly concurrent (usually implemented as lock-free) data structures are not always useful. In some workload pattern, using such data structures is not good. To be sure, I recommend you to write a small micro-benchmark for two versions of hash tables: (1) lock-free and (2) lock-based. Also, please keep in mind the workload patterns for such micro-benchmark should be close to real one. An example could be found in here.

Rhizomorphous answered 8/11, 2009 at 20:41 Comment(5)

I know they're sometimes slower. But they're also sometimes easier in a specific solution. I have a big hash that is often read, rarely written to. Also writes take a long time and need a lookup first. With a pool of readers and 1 writer it's a perfect case for STM. Also, you DO NOT need an on-line conflict checking. You can implement many STM-style solutions in a way that reader NEVER waits and NEVER blocks (and writer never waits, although might need to retry) using only the standard CMPXCHG. For me that's one synchronisation every 30 sec -vs- grabbing the read lock every 10ms. – Almemar 8/11, 2009 at 21:35

I understand your problem. It's better to use "lock-free" data structure rather than STM-style. STM always means conflict checking. What about RCU? en.wikipedia.org/wiki/Read-copy-update – Rhizomorphous 8/11, 2009 at 21:38

PS. I know that concurrent_hash_map is supposed to be "lock-less", but they write: "Because having access to an element can block other threads, try to shorten the lifetime of the accessor or const_accessor." Not good enough for me in this case. – Almemar 8/11, 2009 at 21:39

Minjang: That's what I meant (more or less ;) ). From what I understand an RCU trie-based immutable GC'd hash map pretty much meets the definition of an STM hash map (i.e. has exactly the same properties). Or am I missing something? – Almemar 8/11, 2009 at 21:48

STM is just software-level implementation of TM. TM is a speculative approach to handle locks efficiently. By definition, TM is a generalization of lock-free data structures. So, STM hash map actually doesn't make sense to me, and I never heard such terms. It's just wait-free/lock-free. In current x86 architecture, there is no magic that you can do better than a conventional lock-free data structure. I believe RCU-like data structure is the best for read-intensive workloads. – Rhizomorphous 8/11, 2009 at 21:58

TBoost.STM has implemented a hash map in its example.

Jere answered 20/2, 2012 at 9:7 Comment(0)

Recommended topics

Hot tags