JNI wrapper compilation

gbenson Uncategorized Wednesday 28th October 2009Wednesday 28th October 2009 2 Minutes

I now have a version of Shark with a basic implementation JNI wrapper compilation. Sadly I can’t say if it’s faster or not yet because it’s totally unstable!

The problem is this. When HotSpot wishes to compile a normal (interpreted) method, the thread initiating the compile simply adds it to a queue and carries on doing whatever it was it was doing. A separate thread, the compiler thread, loops over this queue, compiling methods one at a time. This means there’s only ever the one thread making LLVM calls, and everything is rosy.

When HotSpot wishes to compile a native (JNI) method, the thread initiating the compile bypasses the queue and does it immediately. It acquires a lock, the adapter hander library lock, so there’s only every one native method being compiled at once, but in the meantime the compiler thread is in all likelihood busy compiling some normal method or another, so there’s two threads making LLVM calls. LLVM doesn’t like this, not LLVM prior to 2.6 at any rate, and even then not without being written with a separate LLVMContext for each thread.

The obvious fix for this is for Shark to acquire a lock before compiling either a normal or a native method, ensuring that only one thread is calling into LLVM at once. This doesn’t work, however, as the compiler thread runs _thread_in_native. The benefit of this is that the compiler thread does not have to halt for safepoints (and the rest of the VM doesn’t have to wait for the compiler thread to halt) but the drawback of this is that threads running _thread_in_native may not own locks. You can’t make the compiler thread run other than _thread_in_native, not without losing the large chunk of the server compiler that Shark shares, and you can’t hack a lock in there anyway (by using a pthread mutex, say, rather than a HotSpot lock) because it’ll deadlock the first time a safepoint occurs when the compiler thread holds the lock (compiling a normal method) and a Java thread is blocking trying to take it (to compile a native one).

I’ve been circling around this issue for a couple of days now, but it looks like the only solution is to require LLVM 2.6 and rearrange everything to use separate contexts.

Published by gbenson

I make things // he/him View all posts by gbenson

Published Wednesday 28th October 2009Wednesday 28th October 2009

4 thoughts on “JNI wrapper compilation”

Xerxes RÃ¥nby says:

Wednesday 28th October 2009 at 12:12

Would it be possible to make the HotSpot thread that initialises the compile to put the native (JNI) method on the queue, like normal method compilation, instead of compiling it immediately; as a quick fix before we have full multi-thread compilation support in Shark using LLVM 2.6?
gbenson says:

Wednesday 28th October 2009 at 12:21

I tried that already ;)
http://mail.openjdk.java.net/pipermail/zero-dev/2009-October/000237.html
Jeffrey Yasskin says:

Monday 2nd November 2009 at 06:11
I don’t know anything about HotSpot or Shark, so sorry if this is naive.

Can _thread_in_native threads call into Java code? If so, I think you can implement your own lock that’ll do this safely:
- Create three new variables. Using C++0x syntax:
  atomic threads_wanting_to_JIT(0);
  atomic compiler_thread_running(true);
  Object llvm_lock
- The JIT is locked whenever compiler_thread_running=true OR llvm_lock is held. , and by Create a global atomic variable, threads_wanting_to_JIT, initialized to 0. While this is 0, the standard compiler thread gets to run.
- When a thread discovers that it needs to JIT a JNI method, it runs:
```
  threads_wanting_to_JIT.fetch_add(1);
  synchronized(llvm_lock) {
    while (compiler_thread_running.load()) { llvm_lock.wait(); }
    // Now we know that the compiler thread isn't calling into the JIT (see below),
    // and at most one non-compiler thread can be here because of the llvm_lock.
    CompileStuff();
    threads_wanting_to_JIT.fetch_sub(1);
    llvm_lock.notifyAll();
  }
```
- Between JIT operations, the compiler thread runs:
```
  if (threads_wanting_to_JIT.load() > 0) {
    enter_java {
      synchronized(llvm_lock) {
        compiler_thread_running.store(false);
        try {
          llvm_lock.notifyAll();
          while (threads_wanting_to_JIT.load() > 0) { llvm_lock.wait(); }
        } finally {
          compiler_thread_running.store(true);  
        }
      }
    }
  }
```
I’m assuming all sequentially-consistent atomics, or Java volatiles or AtomicIntegers. You may be able to relax the memory ordering some, but it’s probably not worth the risk. Of course, go over the above with a fine-toothed comb before actually using it; it’s easy to make mistakes here.

Notwithstanding the above, using separate LLVMContexts will probably be more efficient overall, unless the JNI interfaces need access to anything from the main compilation Context. With separate LLVMContexts, you won’t have to wait for the main compiler to finish before generating JNI interfaces.
Jeffrey Yasskin says:

Monday 2nd November 2009 at 16:49

Oops, I failed at editing above. “threads_wanting_to_JIT” should be an “atomic<int>”; “compiler_thread_running” should be an “atomic<bool>”, and the second bullet should end at the first period.

Published by gbenson

4 thoughts on “JNI wrapper compilation”

Leave a ReplyCancel reply