Subject: Re: C444 JGFSI Reviews
Resent-Date: Fri, 12 Nov 1999 08:45:15 -0500
Resent-From: Geoffrey Fox <gcf@npac.syr.edu>
Resent-To: p_gcf@npac.syr.edu
Date: Mon, 08 Nov 1999 22:29:37 +0900
From: "ISHIZAKI, Kazuaki" <ishizaki@trl.ibm.co.jp>
To: Geoffrey Fox <gcf@npac.syr.edu>

Dear Dr. Geoffrey Fox,

  I have revised my paper. But, as some referees pointed out, my paper
has conversion problems to PDF. Can I send it to you by MS Word format ?

Best Regards,

On Mon, 11 Oct 1999 17:07:43 -0400
Geoffrey Fox <gcf@npac.syr.edu> wrote:

> I enclose 3 Referee reports on your paper.
> We would be pleased to accept it and could you please send me
> a new version before November 5 99
> Please send a memo describing any suggestions of the
> referees that you did not address
> Ignore any aggressive remarks you don't think appropriate but
> please tell me. I trust you!
>
> Thank you for your help in writing and refereeing papers!
>
> Referee 1 ****************************************************************
> Subject: C444 JGSI Review
>
> Paper no.:  C444
> Title: Design, Implementation, and Evaluation of Optimizations in a
> Java Just-In-Time Compiler
>
> Overall recommendation: Publish with minor revisions
>
> Comments to authors:
>
> An interesting and clearly presented paper.  Some of the English,
> although correct, is a little awkward in places.
>
> section 3.3, para 1: the last sentence (Because CSE ...) does not make
> sense.
>
> section 5.1, line 2: small method -> small methods
>
> section 5.2, last para: Most people would argue that Java interfaces are not
> true multiple inheritance.
>
> section 5.2, last para, line 2: only a class -> only one class
>
> section 6, line 7: "Swing 1.0.3 was executed with clicks to all tabs"
> - please explain what this means.
>
> section 6.2 and following. In the performance bar charts it is hard to
> distinguish between the different keys - maybe a problem in converting
> ti PDF? also the key panel overlaps the main panel in some cases.
>
> section 6.7: It would be good to see the performance of the
> applications with *none* of the optimisations applied, in addition to
> what is presented. It is not obvious that the optimisations are completely
> independent.
>
> section 7, para 3, lines 5-6: introduce a runtime test newly ->
> introduce a new runtime test
>
> section 7, para 5, line 2: replacement -> replacements
>
> section 8, line 1: delete full stop after "compiler"
>
> Comments to editor:
> No significant problems.
>
> Referee 2 ************************************************************************
> Subject: C444 JGSI Review
>
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>
> Overall Recommendation: Accept
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
>
> This paper describes a number of optimizations performed by IBM's
> Java JIT compiler, and evaluates their effectiveness.  The paper is
> technically correct and presents results relevant to the Java community.
> The wording could be improved in places; below are some suggestions
> for such improvements.  I appreciate that the authors solicited the
> help of a native English speaker to polish the prose; I would
> recommend a second pass.
>
> Detailed comments
> ~~~~~~~~~~~~~~~~~
>
> Page 2: "Section 0" should read "Section 7"
>
> Page 2: "Inlining of Dynamic method call is applied ..." should read
> "Inlining of Dynamic method calls is applied ..."
>
> Page 2: The authors mention that they introduced a new byte code instruction
> to represent an object's interior pointer.  It is not clear to me why they
> needed to introduce a new byte code, as opposed to just generating the
> appropriate PowerPC instruction.
>
> Page 3: "... was reduced to 6 by the existing method, ..." would be clearer
> as "... was reduced to 6 by Gupta's method, ..."
>
> Example 2: (Keep in mind that I don't know PowerPC assembly.) I did not
> understand the instruction "twi LLT, r7, 1". I would have expected to see
> something like "twi EQ, r7, 0" (i.e. trap if r7 is 0), since the next
> instruction divides r3 by r7.
>
> Page 4: "... the top pointer to the object must be kept in the memory or
> register.  Because CSE generates an interior pointer of an object and the
> garbage collector does not scan an interior pointer of an object because of
> its performance." is rather confusing.  Here's a stab at rephrasing it:
> "... the top pointer to the object must be kept in memory or in a
> register, so that the garbage collector classifies the object as reachable.
> For performance reasons, the collector does not scan the interior pointers
> generated by CSE."
>
> Page 4: "First, ... Finally, ..." don't go together well.  Replace by:
> "First, ... Second, ...."
>
> Page 5: "If all of them tests fail" should read "If all of these tests fail"
>
> Page 5: "then the C runtime library is executed" should read "then a C
> function in the Java runtime is executed"
>
> Page 5: "In this section, we describe two optimizations of method calls:
> first, inlining of static method calls.  Secondly, and then devirtualization
> of dynamic method calls."  could be phrased more succinctly as: "In this
> section, we describe two optimizations of method calls: inlining of static
> method calls, and devirtualization of dynamic method calls."
>
> Page 6: "the constructor is invoked when a new class is created" should read
> "the constructor is invoked when a new object is created".
>
> Page 6: "the JIT compiler inlines small method" should read
> "our JIT compiler inlines small methods".
>
> Page 6: "Dynamic method call is an important feature of object-oriented
> language because of the flexibility, and it is therefore used frequently."
> I would prefer: "Dynamic method call is the defining feature of object-
> oriented languages, and is used frequently."
>
> Page 6: "... to improve the performance of dynamic method call."
>     --> "... to improve the performance of dynamic method calls."
>
> Page 6: "... in which the class hierarchy may change in the future."
>     --> "... in which the class hierarchy may change at runtime."
>
> Page 7: "If CHA finds that only a class implements an interface class, ..."
>     --> "If CHA finds that only one class implements an interface, ..."
>
> Page 7: "inlining of static method call"
>     --> "inlining of static method calls"
>
> Page 7: "raytrace (a body of mtrt)"  What is mtrt?
>
> Page 8/Figure 2: The authors indicate that Figure 2 shows some dark bars
> some white bars.  However, in my copy of the paper all the bars are white
> The same is true for Figures 3 through 7; the text suggests that some bars
> should be non-white, but in my copy, all bars are white.  (This problem
> might have been introduced during the conversion of the paper to PDF ...)
>
> Page 9: "Figure 3 shows the distribution of object types in type inclusion
>          test at runtime."
>     --> "Figure 3 shows the distribution of object types in type inclusion
>          tests at runtime."
>
> Page 9: "In each case, 14%, 28%, and 45% of the original array accesses
> are used."  I'm not sure what the authors are trying to say here. One
> interpretation would be: "For db, 14% or the original array accesses are
> used; for tsgp, 28%; and for tmix, 45%."
>
> Page 9: "In all cases, it optimizes ..."
>     --> "In all cases, our method optimizes ..."
>
> Page 10: "The performance in devirtualizing dynamic method call ..."
>      --> "The performance in devirtualizing dynamic method calls ..."
>
> Page 11: "Therefore, the is still scope to improve ..."
>      --> "Therefore, there is still room to improve ..."
>
> Page 12/Section 6.7:  I would like to see one extra bar labeled NONE in
> Figure 8, showing the baseline performance of the benchmark programs,
> without any of the optimization applied.  Also, the authors should clarify
> if the reported times include the time needed to perform the optimizations,
> or if this overhead was factored out.
>
> Page 12: "Figure 8 thus shows that all optimizations contribute to an
> improvement in the performance."  I would qualify this statement as follows:
> "Figure 8 thus shows that all optimizations contribute to an improvement in
> the performance of some of the programs."
>
> Page 14: "For javac, the inlined codes of both approaches account for 92%."
> I'm not sure what the authors are trying to say.  Maybe: "For javac, the
> inlined codes of both approaches are sufficient to decide 92% of all type
> inclusion tests."
>
> Page 14: "... because direct-binded code is undone ..."
>      --> "... because direct binding code is undone ..."
>
> Page 14: "... we showed that there is still scope for further improvements
>           of the Java performance."
>      --> "... we showed that there is still room for further performance
>           improvements."
>
> ~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~~
> Referee 3 *******************************************************************
> Subject: C444 JGSI Review
>
>
> C444:Design, Implementation, and Evaluation of Optimizations in a Java
>      Just-In-Time Compiler
>
> a)Overall Recommendation
>
>    paper needs changes and should be re-submitted before being accepted
>
> b)Words suitable for authors
>
>   This paper discusses and evaluates optimizations that can be performed
>   by a JIT compiler for Java.  Several of the author's optimization
>   techniques improve on previous work, or propose new solutions to
>   optimizing OO code.  The problem with this paper is that the authors
>   fail to show timing results that measure overall performance improvements
>   that result from their optimizations.  The timing measures they present,
>   compare only times of various combinations of their optimizations.  They
>   need to add a base case with which to compare their optimizations...for
>   each benchmark, they should measure the running time on a JIT with none
>   of their optimizations, and then show % improvements in execution time by
>   applying different optimizations.  Some of their optimization techniques
>   are interesting, and I suspect would result in improvements in total
>   execution time for a least some of these benchmarks, but they never show
>   these numbers and these measures need to be in this paper...adding
> compiler
>   optimizations to a JIT will result in longer running times, they need to
>   show that the increase in compiling time due to their optimizations is
>   recovered in producing faster code.
>
>   A secondary problem is the authors claim in the Related Work section
>   that "Finally, nothing can outperform devirtualization with inlining".
>   This is a strong claim with no reference supporting it.  I would guess
> that
>   polymorphic in-line caching will actually do better in some cases because
>   it really can handle overridden method calls.  Devirtualization only can
>   be applied to virtual method calls that are not polymorphic at run-time.
>   So, if their claim is true (and I think it is too strong), they need to
>   cite a result that shows this, or demonstrate this result in this paper.
>
>   minor changes:  The bar graphs in figures 4-7 are unreadable. In the pdf
>         version, each bar is colored with the same stripe pattern.  As a
>         result, there is no way to tell which bar is which in graphs that show
>         multiple values for each benchmark.
>

---
Kazuaki Ishizaki
IBM Japan, Tokyo Research Laboratory
Mail : ishizaki@trl.ibm.co.jp
TEL: ++81-46-273-4664, FAX: ++81-46-274-4282