What is the function that is responsible for inlining the method?

A MethodInliner is responsible for inlining the method in a semantically correct manner:– if the inlined method expects dynamic information, it first inserts a prologue that does the wrapping of all the parameters.

What is the only thing Jinline can do systematically?

Jinline only takes care of wrapping and unwrapping primitive types and exceptions, which is actually the only thing it can do systematically.

What is the function that is responsible for inlining a method?

For the inlining part, a MethodParser is responsible for parsing a method body and notifying the appropriate Jinlers whenever needed.

What is the main drawback of a runtime MOP?

Compared to static transformation systems – such as macro systems, inlining systems, and compile-time MOPs –, where the link between the modifier and the modified entity is merged at some point, runtime MOPs maintain this link, known as the causal connection link [14,15], at run time, thus enabling dynamic updates of this link at the expense of a certain overhead.

What is the purpose of inlined code?

In addition to this, choosing to inline methods provides us with a natural way to pass dynamic information at run time to the inlined piece of code: all relevant information is packed and passed as argument of the inlined method.

What is the initialization work for the Jinler?

The initialization work in this case simply consists of telling the Jinliner that it should notify the Jinler upon occurrences of constructor sends (1).

(Open Access) Altering Java Semantics via Bytecode Manipulation (2002) | Éric Tanter

Q: What is the purpose of the paper?

The purpose of the work the authors present in this paper is to provide a tool enabling such alterations with the appropriate level of abstraction.

Q: What is the only possible replacement for the factory method?

The only possible replacement is:new Point(1, 2); =⇒ Factory.getPoint(1, 2);The following issues come to light:– First, the name of the instantiated class is not passed as a parameter, which implies that the authors need a method per class (a getPointmethod, a getTriangle method, etc.).

Altering Java Semantics via Bytecode

Manipulation

Eric Tanter

, Marc S´egura-Devillechaise

, Jacques Noy´e

, and Jos´e Piquer

University of Chile, Computer Science Dept.

Avenida Blanco Encalada 2120, Santiago, Chile,

{etanter,jpiquer}@dcc.uchile.cl

Ecole des Mines de Nantes, OCM group

La Chantrerie, 4, rue Alfred Kastler. B.P. 20722,

F-44307 Nantes Cedex 3, France,

{msegura,noye}@emn.fr

Abstract. Altering the semantics of programs has become of major

interest. This is due to the necessity of adapting existing software, for

instance to achieve interoperability between oﬀ-the-shelf components. A

system allowing such alterations should operate at the bytecode level in

order to preserve portability and to be useful for pieces of software whose

source code is not available. Furthermore, working at the bytecode level

should be done while keeping high-level abstractions so that it can be

useful to a wide audience. In this paper, we present Jinline, a tool that

operates at load time through bytecode manipulation. Jinline makes it

possible to inline a method body before, after, or instead of occurrences of

language mechanisms within a method. It provides appropriate high-level

abstractions for ﬁne-grained alterations while oﬀering a good expressive

power and a great ease of use.

1 Introduction

Altering the semantics of programs serves many objectives in software engineer-

ing, related to software adaptation. A particular case of software adaptation,

highlighted by Keller and H¨olzle in [1], is to make several oﬀ-the-shelf com-

ponents interoperable [2]. To this end, Keller and H¨olzle proposed binary

component adaptation (BCA), a tool for performing coarse-grained alterations

on component binaries. However, coarse-grained alterations, usually limited

to modiﬁcations of the interface or of the type hierarchy, may turn out to be

insuﬃcient. Another objective addressed by alteration of program semantics is

that of separation of concerns [3], as emphasized by the work carried out within

the reﬂection community [4,5,6], and more recently, by the emerging paradigm of

aspect-oriented programming (AOP) [7]. In both cases, an important objective

is to separate the development of the functional core of an application from the

implementation of its non-functional concerns, such as persistency, distribution,

or security. The complete application is then obtained by merging the diﬀerent

parts together. Such a merging requires to perform ﬁned-grained alterations

D. Batory, C. Consel, and W. Taha (Eds.): GPCE 2002, LNCS 2487, pp. 283–298, 2002.

 Springer-Verlag Berlin Heidelberg 2002

284

E. Tanter et al.

within method bodies. The purpose of the work we present in this paper is to

provide a tool enabling such alterations with the appropriate level of abstraction.

In Java, portable transformation mechanisms require code rewriting. This

usually automated rewriting can be performed on source code or on bytecode.

The Java community has already developed an impressive set of tools trans-

forming source code: AspectJ [8] to support AOP, Sun’s JavaScope project to

instrument source code, a Dylan-like macro system called Java Syntactic Exten-

der [9] and a class-based macro system, OpenJava [10]. Nevertheless, in many

contexts, expecting source code availability is a mistake: oﬀ-the-shelf compo-

nents usually ship in binary form, and sophisticated distributed systems, like

mobile agent platforms, usually rely on dynamic class loading. Therefore, while

still interesting in themselves, these tools are not generally applicable. This is

why we claim that transformation tools should operate on bytecode.

Available transformation tools based on bytecode rewriting are usually

inadequate for a wide and generic use. First, most of these tools oﬀer bytecode-

level abstractions. This is inadequate if the tool has to be used by a wide

audience, since precise knowledge of the bytecode language is required. This

point has been addressed by Javassist [11], which oﬀers high-level abstrac-

tions. Though targeted to structural reﬂection, Javassist can be used to

perform ﬁne-grained alterations. However, in this domain, Javassist suﬀers from

a limited expressive power and a lack of generality, as we will discuss in section 2.

In this perspective, we propose Jinline, a tool for altering Java semantics.

Jinline operates on bytecode, keeps high-level abstractions, oﬀers a good ex-

pressive power and generality. To summarize, Jinline makes it possible to inline

a method body before, after, or instead of a language mechanism occurrence

within a method.

Traditionally, inlining means replacing a call to a function by an instance of

the function body [12]. What Jinline actually does is inserting code or replacing

code. The new code is deﬁned by a method and therefore the inserted code is

conceptually a method call, except that Jinline actually inlines this new method.

Hence, although Jinline cannot be qualiﬁed as an inliner, most of its job consists

of inlining pieces of code into others. In addition to this, Jinline provides two

diﬀerent sets of information:

1. Static information at inlining time. Jinline provides static information

that can be used to drive the inlining process. For instance, in the case of

a message send, it will provide the signature of the invoked method. This

helps to decide whether inlining should occur or not, which method should

be inlined and where (before, after, instead of).

2. Dynamic information at run time. Jinline ensures that the inlined

method will receive as arguments all the useful dynamic information that

By language mechanisms we refer to the standard mechanisms oﬀered by the lan-

guage, such as message sending, accessing ﬁelds, casting, etc. A language mechanism

occurrence is a particular instance of a language mechanism in a piece of code.

Altering Java Semantics via Bytecode Manipulation 285

can be extracted. This point is very important since it makes the tool partic-

ularly suited for implementing generic extensions, as we will exemplify in the

rest of this paper. In the case of a message send, the dynamic information

includes the method invoked, the method from which the invocation is done,

references to the caller and the callee, in addition to the actual arguments

of the invocation.

Applications of such an alteration tool are manifold. We have already

mentioned the issue of oﬀ-the-shelf components integration. Two of the authors

are actually working on an open implementation of a run-time MetaObject

Protocol (MOP), Reﬂex [13]. Many transformers for the Reﬂex framework can

be implemented with Jinline, thus increasing its expressiveness with caller-side

interceptions. Jinline is also particularly adapted for implementing custom

extensions and AOP systems.

The rest of this paper is organized as follows: in section 2, we will review

the diﬀerent Java bytecode manipulation tools and relate our work to them.

In section 3 we will present Jinline, its interface to the outside world and an

overview of its architecture. In section 4 we will present a simple example of

applying Jinline. Section 5 will conclude the paper.

2 An Overview of Bytecode Manipulation Tools

One way of modifying a program is to alter its semantics by using reﬂection [14,

15]. However, the Java programming language does not provide support for

altering the semantics of programs. Since the class model is closed (class Class

and all the classes of the Reﬂection API are ﬁnal), it is not possible to reﬁne

the semantics of language mechanisms by specializing the class model, as can

be done in Smalltalk [16]. Therefore, alterations have to be implemented either

at the virtual machine level, like in VM-based run-time metaobject protocols

like Metaxa [17], Guaran´a [18] and Iguana/J [19] thus sacriﬁcing portability, or

at the code level, through code transformation. We have already discarded the

possibility of operating on source code for reasons of availability of the source

code itself. This is why a number of propositions have been made to transform

bytecode. These propositions diﬀer in terms of the abstraction level of the entities

a user is expected to program with, and in the expressive power or granularity

of the transformations permitted.

2.1 Transformations Based on Bytecode-Level Abstractions

A number of extensions allow programmers to transform classes at load time at

the expense of manipulating abstractions representing bytecode.

BIT [20] suﬀers from a too restricted scope: it only oﬀers the possibility to

insert before/after methods, but does not address transformation of interfaces

or method bodies.

286

E. Tanter et al.

There are several general-purpose implementations of bytecode manipulation

available: BCEL [21], JikesBT [22], and JOIE [23]. All of them translate the

class ﬁle data structure into an intermediate representation, allow the user to

perform modiﬁcations and to ﬁnally regenerate a valid class ﬁle data structure

from the transformed intermediate representation. The bytecode-level API of

Javassist [11] could ﬁt into this category although bytecode instructions are not

reiﬁed: the programmer is just provided with an iterator over a sequence of

bytes. The main strength of these general-purpose extensions is their expressive

power, since they are able to express anything that can be written in bytecode.

However, their main drawback is to be low-level and therefore diﬃcult to use.

2.2 Transformations Based on Source-Level Abstractions

Metaobject protocols (MOPs) are a natural framework for reifying high-level

language entities [24]. Run-time MOPs are an approach to enable the run-time

alteration of program semantics. Compared to static transformation systems –

such as macro systems, inlining systems, and compile-time MOPs –, where the

link between the modiﬁer and the modiﬁed entity is merged at some point, run-

time MOPs maintain this link, known as the causal connection link [14,15], at

run time, thus enabling dynamic updates of this link at the expense of a certain

overhead.

Reﬂex [13] and Kava [25] are run-time MOPs for Java that rely on load-

time insertion of pieces of code (hooks) to transfer control to the metalevel

at run time. These systems are bound to behavioral reﬂection, which is the

ability of dynamically altering the behavior of objects. This approach is in fact

complementary to static code transformation approaches in cases where dynamic

adaptability or instance-speciﬁc alterations are needed (see for instance [26]).

BCA [1] is a bytecode modiﬁcation tool with a high-level interface, but it

only deals with external interfaces and class hierarchies, ignoring method bod-

ies. Javassist [11] is a mature tool for load-time structural reﬂection in Java.

Structural reﬂection is the ability of a program to alter the deﬁnitions of data

structures such as classes and methods. With Javassist, the transformations that

can be made are at the granularity of class or members. The main goal achieved

by Javassist is a high-level and easy-to-use interface. To allow ﬁner-grained trans-

formations, Javassist has recently made public its bytecode-level API, which we

mentioned in subsection 2.1. Recall that it lacks a concrete reiﬁcation of bytecode

instructions. To bridge the gap between its high-level and low-level APIs, Javas-

sist oﬀers a code converter to instrument method bodies through a high-level

interface.

2.3 Limitations of the Code Converter of Javassist

The code converter of Javassist – the closest tool to our proposal – oﬀers a simple

high-level API to alter method bodies. This API allows inserting before/after

methods, redirecting method invocations or ﬁeld accesses, and replacing cre-

ations. We claim that its expressiveness is limited and that it lacks generality.

Altering Java Semantics via Bytecode Manipulation 287

Its limited expressiveness is in fact not that much an issue since it can actually

be upgraded, and also, in many cases, it is suﬃcient to alter such mechanisms

as method invocations, ﬁeld accesses and object creations. A more annoying

problem is the limitation about the possible transformations: for instance, a ﬁeld

access can only be replaced by a static method call, and a method invocation

can only be replaced by another method invocation on the same object with the

same parameters.

But all in all, the major drawback of the code converter lies in the fact that

it is not well-suited to designing generic solutions. Since Javassist lacks semantic

information in the process of modifying bytecode (remember that Javassist

does not reify bytecode instructions as such), the possible transformations

are limited. The code converter does not perform any reiﬁcation of what is

actually occurring. For instance, an object creation can be replaced by a method

invocation, but this method will not receive as argument the name of the class

that was to be instantiated: it has to be speciﬁc to a type. This limitation is

common to all transformations.

To illustrate this limitation, consider the following simple example: we want

to set up a factory pattern [27] for instantiating any class in an existing appli-

cation. That is to say, instead of calling directly new, we want to call a factory

method. Designed with generality and extensibility in mind, the factory method

would be:

public Object getInstance(String classname, Object[] args){...}

Then we want to transform all the instantiations so that they call this unique

factory method, for instance:

new Point(1, 2); =⇒ Factory.getInstance(”Point”, [1, 2]);

This is not feasible with the code converter. The only possible replacement is:

new Point(1, 2); =⇒ Factory.getPoint(1, 2);

The following issues come to light:

– First, the name of the instantiated class is not passed as a parameter, which

implies that we need a method per class (a getPoint method, a getTriangle

method, etc.).

– Second, the arguments are not packed, which means we need a method

per set of parameters (a method getPoint(int, int), another method

getPoint(Point), etc.).

It is easy to see that such an approach is not applicable to real world cases.

What is needed is a tool that can systematically provide runtime information

in a cost-eﬀective manner to the new inserted code. In addition to this, more

ﬂexibility with respect to what code can be inserted is highly appreciable. This

is exactly what Jinline is about.

Altering Java Semantics via Bytecode Manipulation

Figures

Citations

An overview of AspectJ

An easy-to-use toolkit for efficient Java bytecode translators

Advanced Java bytecode instrumentation

Partial behavioral reflection: spatial and temporal selection of reification

Web cache prefetching as an aspect: towards a dynamic-weaving based solution

References

Design Patterns: Elements of Reusable Object-Oriented Software

Aspect-oriented programming

Smalltalk-80: The Language and its Implementation

An overview of AspectJ

An Overview of AspectJ

Related Papers (5)

Load-time structural reflection in Java

An Overview of AspectJ

The Java Virtual Machine Specification

Exploiting hardware performance counters with flow and context sensitive profiling

Aspect-oriented programming

Frequently Asked Questions (9)

Q1. What is the function that is responsible for inlining the method?

Q2. What is the only thing Jinline can do systematically?

Q3. What is the function that is responsible for inlining a method?

Q4. What is the purpose of the paper?

Q5. What is the main drawback of a runtime MOP?

Q6. What is the main drawback of Javassist?

Q7. What is the purpose of inlined code?

Q8. What is the only possible replacement for the factory method?

Q9. What is the initialization work for the Jinler?