Rxjava Loop Array Again and Again

Chapter 1. Reactive Programming with RxJava

RxJava is a specific implementation of reactive programming for Coffee and Android that is influenced by functional programming. It favors function composition, abstention of global state and side effects, and thinking in streams to etch asynchronous and event-based programs. It begins with the observer blueprint of producer/consumer callbacks and extends it with dozens of operators that let composing, transforming, scheduling, throttling, fault handling, and lifecycle management.

RxJava is a mature open source library that has found broad adoption both on the server and on Android mobile devices. Along with the library, an agile community of developers has congenital up around RxJava and reactive programming to contribute to the project, speak, write, and assist one some other.

This chapter volition provide an overview of RxJava—what it is and how it works—and the balance of this book will have you through all of the details of how to use and utilise information technology in your applications. You tin can brainstorm reading this book with no prior experience with reactive programming, but we will start at the beginning and take you lot through the concepts and practices of RxJava so that you lot can employ its strengths to your use cases.

Reactive Programming and RxJava

Reactive programming is a full general programming term that is focused on reacting to changes, such as data values or events. Information technology can and oft is done imperatively. A callback is an approach to reactive programming washed imperatively. A spreadsheet is a not bad example of reactive programming: cells dependent on other cells automatically "react" when those other cells change.

On today'south computers everything ends upwardly beingness imperative at some point as it hits the operating system and hardware. The estimator must be told explicitly what needs to exist done and how to do it. Humans do not call back similar CPUs and related systems, then we add abstractions. Reactive-functional programming is an abstraction, just similar our higher-level imperative programming idioms are abstractions for the underlying binary and associates instructions. The fact that everything ends up imperative is of import to remember and understand because it helps us with the mental model of what reactive-functional programming is addressing and how it ultimately executes—there is no magic.

Reactive-functional programming therefore is an approach to programming—an abstraction on top of imperative systems—that allows united states of america to plan asynchronous and event-driven utilise cases without having to think like the reckoner itself and imperatively define the complex interactions of country, particularly across thread and network boundaries. Not having to call back like the calculator is a useful trait when information technology comes to asynchrony and effect-driven systems, because concurrency and parallelism are involved, and these are very challenging characteristics to apply correctly and efficiently. Inside the Coffee community, the books Java Concurrency in Practice by Brian Goetz and Concurrent Programming in Java by Doug Lea (Addison-Wesley), and forums such as "Mechanical Sympathy" are representative of the depth, breadth, and complexity of mastering concurrency. My interactions with experts from these books, forums, and communities since I started using RxJava has convinced me even more than before of how difficult information technology actually is to write high-performance, efficient, scalable, and right concurrent software. And we haven't even brought in distributed systems, which accept concurrency and parallelism to another level.

And so, the short answer to what reactive-functional programming is solving is concurrency and parallelism. More colloquially, information technology is solving callback hell, which results from addressing reactive and asynchronous apply cases in an imperative style. Reactive programming such as that implemented past RxJava is influenced by functional programming and uses a declarative approach to fugitive the typical pitfalls of reactive-imperative code.

When Y'all Need Reactive Programming

Reactive programming is useful in scenarios such as the following:

Processing user events such every bit mouse move and clicks, keyboard typing, GPS signals changing over fourth dimension every bit users move with their device, device gyroscope signals, touch on events, and and then on.
Responding to and processing any and all latency-bound IO events from disk or network, given that IO is inherently asynchronous (a request is made, time passes, a response might or might not be received, which then triggers further work).
Handling events or data pushed at an awarding past a producer it cannot control (system events from a server, the same user events, signals from hardware, events triggered by the analog globe from sensors, and then on).

Now, if the code in question is handling merely one event stream, reactive-imperative programming with a callback is going to be fine, and bringing in reactive-functional programming is not going to give you much benefit. You tin have hundreds of different outcome streams, and if they are all completely independent of one another, imperative programming is not probable to be a problem. In such straightforward employ cases, imperative approaches are going to exist the most efficient because they eliminate the brainchild layer of reactive programming and stay closer to that for which current operating systems, languages, and compilers are optimized.

If your program is like about though, you demand to combine events (or asynchronous responses from functions or network calls), take provisional logic interacting between them, and must handle failure scenarios and resource cleanup on any and all of them. This is where the reactive-imperative approach begins to dramatically increase in complication and reactive-functional programming begins to shine. A non-scientific view I have come to accept is that reactive-functional programming has an initially college learning curve and barrier to entry merely that the ceiling for complexity is far lower than with reactive-imperative programming.

Hence this is where the tagline for Reactive Extensions (Rx) in general and RxJava specifically comes from, "a library for composing asynchronous and outcome-based programs." RxJava is a physical implementation of reactive programming principles influenced by functional and data-menstruum programming. In that location are different approaches to existence "reactive," and RxJava is but one of them. Let's dig into how it works.

How RxJava Works

Central to RxJava is the Observable blazon that represents a stream of information or events. It is intended for push button (reactive) merely can likewise be used for pull (interactive). It is lazy rather than eager. Information technology can be used asynchronously or synchronously. Information technology can represent 0, ane, many, or infinite values or events over time.

That's a lot of buzzwords and details, and so let'due south unpack it. You'll get the full details in "Anatomy of rx.Observable".

Push versus Pull

The entire signal of RxJava existence reactive is to back up push, so the Observable and related Observer type signatures support events beingness pushed at it. This in turn by and large is accompanied by asynchrony, which is discussed in the next section. But the Observable type besides supports an asynchronous feedback aqueduct (too sometimes referred to as async-pull or reactive-pull), as an approach to menstruation control or backpressure in async systems. A afterward section in this chapter volition accost period control and how this mechanism fits in.

To back up receiving events via push, an Observable/Observer pair connect via subscription. The Observable represents the stream of data and can be subscribed to by an Observer (which yous'll learn more nearly in "Capturing All Notifications by Using Observer<T>"):

              interface              Appreciable              <              T              >              {              Subscription              subscribe              (              Observer              south              )              }

Upon subscription, the Observer can have three types of events pushed to it:

Data via the onNext() office
Errors (exceptions or throwables) via the onError() part
Stream completion via the onCompleted() function

              interface              Observer              <              T              >              {              void              onNext              (              T              t              )              void              onError              (              Throwable              t              )              void              onCompleted              ()              }

The onNext() method might never be chosen or might be chosen one time, many, or infinite times. The onError() and onCompleted() are terminal events, meaning that but i of them can be called and just once. When a terminal issue is chosen, the Observable stream is finished and no further events tin be sent over it. Terminal events might never occur if the stream is space and does not fail.

As will be shown in "Catamenia Control" and "Backpressure", there is an additional type of signature to allow interactive pull:

              interface              Producer              {              void              request              (              long              due north              )              }

This is used with a more advanced Observer called Subscriber (with more than details given in "Controlling Listeners past Using Subscription and Subscriber<T>"):

              interface              Subscriber              <              T              >              implements              Observer              <              T              >,              Subscription              {              void              onNext              (              T              t              )              void              onError              (              Throwable              t              )              void              onCompleted              ()              ...              void              unsubscribe              ()              void              setProducer              (              Producer              p              )              }

The unsubcribe function equally part of the Subscription interface is used to allow a subscriber to unsubscribe from an Appreciable stream. The setProducer part and Producer types are used to class a bidirectional advice channel betwixt the producer and consumer used for catamenia control.

Async versus Sync

More often than not, an Observable is going to exist asynchronous, but information technology doesn't need to exist. An Observable can be synchronous, and in fact defaults to existence synchronous. RxJava never adds concurrency unless it is asked to do and then. A synchronous Observable would be subscribed to, emit all data using the subscriber's thread, and complete (if finite). An Observable backed past blocking network I/O would synchronously block the subscribing thread so emit via onNext() when the blocking network I/O returned.

For example, the following is completely synchronous:

              Observable              .              create              (              south              ->              {              southward              .              onNext              (              "Hullo Earth!"              );              s              .              onCompleted              ();              }).              subscribe              (              how-do-you-do              ->              Organization              .              out              .              println              (              hello              ));

Y'all volition learn more about Observable.create in "Mastering Observable.create()" and Observable.subscribe in "Subscribing to Notifications from Observable".

Now, as y'all are probably thinking, this is generally non the desired behavior of a reactive system, and you are right. It is bad form to use an Appreciable with synchronous blocking I/O (if blocking I/O needs to be used, it needs to be made asynchronous with threads). All the same, sometimes information technology is appropriate to synchronously fetch data from an in-memory cache and render it immediately. The "Hello World" instance shown in the previous example does not need concurrency, and in fact will be far slower if asynchronous scheduling is added to information technology. Thus, the actual criteria that is generally important is whether the Appreciable event production is blocking or nonblocking, not whether it is synchronous or asynchronous. The "Hi World" example is nonblocking because it never blocks a thread, thus information technology is right (though superfluous) use of an Observable.

The RxJava Observable is purposefully agnostic with regard to async versus sync, and whether concurrency exists or where it comes from. This is by design and allows the implementation of the Observable to make up one's mind what is best. Why might this be useful?

First of all, concurrency tin can come from multiple places, not just threadpools. If the data source is already async because it is on an issue loop, RxJava should not add together more scheduling overhead or force a item scheduling implementation. Concurrency can come from threadpools, event loops, actors, and so on. It tin can exist added, or it can originate from the data source. RxJava is doubter with respect to where the asynchrony originates.

2nd, at that place are two good reasons to use synchronous behavior, which we'll look at in the following subsections.

In-memory information

If data exists in a local in-memory enshroud (with constant microsecond/nanosecond lookup times), it does not make sense to pay the scheduling price to make it asynchronous. The Observable tin just fetch the data synchronously and emit information technology on the subscribing thread, as shown here:

                Observable                .                create                (                s                ->                {                southward                .                onNext                (                enshroud                .                become                (                SOME_KEY                ));                s                .                onCompleted                ();                }).                subscribe                (                value                ->                System                .                out                .                println                (                value                ));

This scheduling option is powerful when the information might or might not be in memory. If it is in memory, emit information technology synchronously; if it's not, perform the network call asynchronously and return the information when it arrives. This choice tin can reside conditionally within the Observable:

                // pseudo-code                Appreciable                .                create                (                southward                ->                {                T                fromCache                =                getFromCache                (                SOME_KEY                );                if                (                fromCache                !=                cipher                )                {                // emit synchronously                s                .                onNext                (                fromCache                );                s                .                onCompleted                ();                }                else                {                // fetch asynchronously                getDataAsynchronously                (                SOME_KEY                )                .                onResponse                (                five                ->                {                putInCache                (                SOME_KEY                ,                v                );                s                .                onNext                (                v                );                due south                .                onCompleted                ();                })                .                onFailure                (                exception                ->                {                southward                .                onError                (                exception                );                });                }                }).                subscribe                (                due south                ->                Organisation                .                out                .                println                (                s                ));

Synchronous computation (such as operators)

The more mutual reason for remaining synchronous is stream composition and transformation via operators. RxJava mostly uses the large API of operators used to dispense, combine, and transform data, such as map(), filter(), take(), flatMap(), and groupBy(). Near of these operators are synchronous, meaning that they perform their computation synchronously within the onNext() equally the events pass past.

These operators are synchronous for operation reasons. Take this as an instance:

                Observable                <                Integer                >                o                =                Observable                .                create                (                south                ->                {                due south                .                onNext                (                1                );                s                .                onNext                (                two                );                s                .                onNext                (                iii                );                southward                .                onCompleted                ();                });                o                .                map                (                i                ->                "Number "                +                i                )                .                subscribe                (                south                ->                Arrangement                .                out                .                println                (                s                ));

If the map operator defaulted to being asynchronous, each number (1, 2, 3) would exist scheduled onto a thread where the cord concatenation would be performed ("Number " + i). This is very inefficient and generally has nondeterministic latency due to scheduling, context switching, and and then on.

The important thing to understand here is that most Appreciable function pipelines are synchronous (unless a specific operator needs to exist async, such as timeout or observeOn), whereas the Observable itself can exist async. These topics receive more in-depth handling in "Declarative Concurrency with observeOn()" and "Timing Out When Events Practice Not Occur".

The following example demonstrates this mixture of sync and async:

                Appreciable                .                create                (                s                ->                {                ...                async                subscription                and                information                emission                ...                })                .                doOnNext                (                i                ->                System                .                out                .                println                (                Thread                .                currentThread                ()))                .                filter                (                i                ->                i                %                ii                ==                0                )                .                map                (                i                ->                "Value "                +                i                +                " processed on "                +                Thread                .                currentThread                ())                .                subscribe                (                s                ->                System                .                out                .                println                (                "SOME VALUE =>"                +                southward                ));                System                .                out                .                println                (                "Will print BEFORE values are emitted"                )

In this example, the Observable is async (it emits on a thread different from that of the subscriber), so subscribe is nonblocking, and the println at the finish will output earlier events are propagated and "SOME VALUE ⇒" output is shown.

However, the filter() and map() functions are synchronously executed on the calling thread that emits the events. This is mostly the behavior we want: an asynchronous pipeline (the Observable and equanimous operators) with efficient synchronous computation of the events.

Thus, the Observable type itself supports both sync and async concrete implementations, and this is by pattern.

Concurrency and Parallelism

Individual Observable streams permit neither concurrency nor parallelism. Instead, they are achieved via limerick of async Observables.

Parallelism is simultaneous execution of tasks, typically on different CPUs or machines. Concurrency, on the other manus, is the composition or interleaving of multiple tasks. If a unmarried CPU has multiple tasks (such as threads) on it, they are executing concurrently but not in parallel by "time slicing." Each thread gets a portion of CPU time earlier yielding to another thread, even if a thread has not however finished.

Parallel execution is concurrent by definition, simply concurrency is not necessarily parallelism. In practice, this ways being multithreaded is concurrency, but parallelism but occurs if those threads are beingness scheduled and executed on unlike CPUs at the exact same fourth dimension. Thus, generically we speak near concurrency and beingness concurrent, just parallelism is a specific course of concurrency.

The contract of an RxJava Appreciable is that events (onNext(), onCompleted(), onError()) tin never be emitted concurrently. In other words, a single Appreciable stream must ever be serialized and thread-safe. Each event can be emitted from a different thread, as long as the emissions are non concurrent. This ways no interleaving or simultaneous execution of onNext(). If onNext() is all the same being executed on i thread, another thread cannot begin invoking it again (interleaving).

Here's an example of what'due south okay:

              Observable              .              create              (              south              ->              {              new              Thread              (()              ->              {              s              .              onNext              (              "1"              );              due south              .              onNext              (              "two"              );              southward              .              onNext              (              "three"              );              due south              .              onNext              (              "four"              );              southward              .              onCompleted              ();              }).              offset              ();              });

This code emits data sequentially, so it complies with the contract. (Note, however, that it is generally advised to not start a thread like that within an Observable. Utilize schedulers, instead, every bit discussed in "Multithreading in RxJava".)

Hither'southward an example of lawmaking that is illegal:

              // DO NOT DO THIS              Appreciable              .              create              (              due south              ->              {              // Thread A              new              Thread              (()              ->              {              s              .              onNext              (              "one"              );              due south              .              onNext              (              "2"              );              }).              start              ();              // Thread B              new              Thread              (()              ->              {              due south              .              onNext              (              "three"              );              southward              .              onNext              (              "four"              );              }).              start              ();              // ignoring need to emit s.onCompleted() due to race of threads              });              // DO Non Exercise THIS

This code is illegal because it has ii threads that tin both invoke onNext() meantime. This breaks the contract. (Also, it would need to safely wait for both threads to consummate to call onComplete, and every bit mentioned before, it is generally a bad idea to manually starting time threads like this.)

So, how do you take advantage of concurrency and/or parallelism with RxJava? Limerick.

A single Observable stream is always serialized, just each Observable stream can operate independently of i another, and thus concurrently and/or in parallel. This is why merge and flatMap end up beingness so normally used in RxJava—to compose asynchronous streams together concurrently. (Y'all tin learn more well-nigh the details of merge and flatMap in "Wrapping Up Using flatMap()" and "Treating Several Observables every bit One Using merge()".)

Here is a contrived example showing the mechanics of two asynchronous Observables running on separate threads and merged together:

              Observable              <              String              >              a              =              Observable              .              create              (              southward              ->              {              new              Thread              (()              ->              {              s              .              onNext              (              "one"              );              due south              .              onNext              (              "two"              );              s              .              onCompleted              ();              }).              starting time              ();              });              Observable              <              String              >              b              =              Observable              .              create              (              south              ->              {              new              Thread              (()              ->              {              due south              .              onNext              (              "iii"              );              southward              .              onNext              (              "4"              );              s              .              onCompleted              ();              }).              start              ();              });              // this subscribes to a and b concurrently,              // and merges into a third sequential stream              Observable              <              String              >              c              =              Observable              .              merge              (              a              ,              b              );

Observable c will receive items from both a and b, and due to their asynchrony, iii things occur:

"one" will appear before "two"
"three" will announced before "4"
The lodge between i/ii and three/4 is unspecified

So why not just allow onNext() to be invoked meantime?

Primarily because onNext() is meant for u.s. humans to utilize, and concurrency is hard. If onNext() could be invoked concurrently, it would mean that every Observer would need to code defensively for concurrent invocation, even when non expected or wanted.

A second reason is because some operations just aren't possible with concurrent emission; for example, scan and reduce, which are common and important behaviors. Operators such equally scan and reduce require sequential upshot propagation so that state can be accumulated on streams of events that are not both associative and commutative. Assuasive concurrent Observable streams (with concurrent onNext()) would limit the types of events that can be processed and crave thread-safety data structures.

Annotation

The Java viii Stream type supports concurrent emission. This is why java.util.stream.Stream requires reduce functions to be associative, because they must support concurrent invocation on parallel streams. The documentation of the java.util.stream parcel almost parallelism, ordering (related to commutativity), reduction operations, and associativity farther illustrates the complexity of the same Stream blazon permitting both sequential and concurrent emission.

A third reason is that performance is affected past synchronization overhead considering all observers and operators would need to be thread-prophylactic, fifty-fifty if most of the time data arrives sequentially. Despite the JVM often being skilful at eliminating synchronization overhead, it is non e'er possible (specially with nonblocking algorithms using atomics) and so this ends up being a functioning tax not needed on sequential streams.

Additionally, information technology is frequently slower to do generic fine-grained parallelism. Parallelism typically needs to be done coarsely, such as in batches of work, to brand upwards for the overhead of switching threads, scheduling work, and recombining. It is far more efficient to synchronously execute on a single thread and take advantage of the many memory and CPU optimizations for sequential computation. On a List or assortment, it is quite easy to accept reasonable defaults for batched parallelism, because all the items are known upfront and tin can be divide into batches (though even and then it is often faster to merely procedure the full list on a unmarried CPU unless the list is very large, or the compute per item is pregnant). A stream, notwithstanding, does not know the piece of work ahead of fourth dimension, it just receives information via onNext() and therefore cannot automatically chunk the work.

In fact, prior to RxJava v1, a .parallel(Function f) operator was added to try to behave similar coffee.util.stream.Stream.parallel() considering that was considered a nice convenience. It was done in a fashion to not break the RxJava contract by splitting a single Observable into many Observablesouthward that each executed in parallel, and then merging them back together. However, it ended upwardly being removed from the library prior to v1 because it was very confusing and almost always resulted in worse functioning. Calculation computational parallelism to a stream of events most e'er needs to exist reasoned well-nigh and tested. Mayhap a ParallelObservable could make sense, for which the operators are restricted to a subset that assume associativity, but in the years of RxJava being used, it has never ended up being worth the effort, because composition with merge and flatMap are effective building blocks to accost the use cases.

Chapter 3 will teach how to use operators to compose Observables to benefit from concurrency and parallelism.

Lazy versus Eager

The Observable type is lazy, meaning it does nothing until it is subscribed to. This differs from an eager type such as a Future, which when created represents active work. Lazyiness allows composing Observables together without data loss due to race weather without caching. In a Future, this isn't a concern, because the single value tin be cached, and then if the value is delivered before limerick, the value will exist fetched. With an unbounded stream, an unbounded buffer would be required to provide this same guarantee. Thus, the Appreciable is lazy and will not showtime until subscribed to then that all limerick tin exist washed before data starts flowing.

In practice, this means ii things:

Subscription, not structure starts work

Due to the laziness of an Observable, creating one does non actually cause any work to happen (ignoring the "work" of allocating the Observable object itself). All it does is define what work should be done when information technology is somewhen subscribed to. Consider an Observable defined like this:

                  Observable                  <                  T                  >                  someData                  =                  Observable                  .                  create                  (                  south                  ->                  {                  getDataFromServerWithCallback                  (                  args                  ,                  data                  ->                  {                  south                  .                  onNext                  (                  data                  );                  southward                  .                  onCompleted                  ();                  });                  })

The someData reference now exists, but getDataFromServerWithCallback is not yet existence executed. All that has happened is that the Observable wrapper has been declared around a unit of work to be performed, the role that lives inside the Appreciable.

Subscribing to the Appreciable causes the work to be done:

                  someData                  .                  subscribe                  (                  s                  ->                  System                  .                  out                  .                  println                  (                  s                  ));

This lazily executes the work represented by the Observable.

Observables can be reused

Because the Observable is lazy, it as well means a item instance can be invoked more once. Continuing with the previous case this means we can do the following:

                  someData                  .                  subscribe                  (                  s                  ->                  Organisation                  .                  out                  .                  println                  (                  "Subscriber 1: "                  +                  s                  ));                  someData                  .                  subscribe                  (                  s                  ->                  System                  .                  out                  .                  println                  (                  "Subscriber 2: "                  +                  s                  ));

Now at that place volition be two separate subscriptions, each calling getDataFromServerWithCallback and emitting events.

This laziness differs from async types such as Future where the Future is created to represent work already started. A Future cannot be reused (subscribed to multiple times to trigger work). If a reference to a Future exists, information technology ways work is already happening. Y'all can see in the preceding sample lawmaking exactly where the eagerness is; the getDataFromServerWithCallback method is eager because it immediately executes when invoked. Wrapping an Observable around getDataFromServerWithCallback allows it to be used lazily.

This laziness is powerful when doing composition. For example:

              someData              .              onErrorResumeNext              (              lazyFallback              )              .              subscribe              (              s              ->              Organization              .              out              .              println              (              due south              ));

In this case, lazyFallback Appreciable represents piece of work that tin be done, just will only exist done if something subscribes to it, and that nosotros just desire subscribed to if someData fails. Of form, eager types tin be made lazy by using function calls (such as getDataAsFutureA()).

Eagerness and laziness each take their strengths and weaknesses, just RxJava Observable is lazy. Therefore, if yous have an Appreciable it won't practice anything until y'all subscribe to it.

This topic is discussed in greater detail in "Embracing Laziness".

Duality

An Rx Observable is the async "dual" of an Iterable. By "dual," we mean the Observable provides all the functionality of an Iterable except in the reverse menstruation of data: it is push button instead of pull. The tabular array that follows shows types that serve both push and pull functionality:

Pull (Iterable)	Push (Appreciable)
T side by side()	onNext(T)
throws Exception	onError(Throwable)
returns	onCompleted()

As per the tabular array, instead of data beingness pulled out via next() by the consumer, it is pushed to onNext(T) past the producer. Successful termination is signaled via the onCompleted() callback rather than blocking the thread until all items have been iterated. In identify of exceptions being thrown up the callstack, errors are emitted as events to the onError(Throwable) callback.

The fact that it behaves every bit a dual effectively means anything you lot can exercise synchronously via pull with an Iterable and Iterator can exist done asynchronously via push with an Observable and Observer. This means that the same programming model can be practical to both!

For example, equally of Coffee 8 an Iterable tin be upgraded to have role limerick via the java.util.stream.Stream type to work similar this:

              // Iterable<String> as Stream<String>              // that contains 75 strings              getDataFromLocalMemorySynchronously              ()              .              skip              (              10              )              .              limit              (              5              )              .              map              (              s              ->              s              +              "_transformed"              )              .              forEach              (              Arrangement              .              out              ::              println              )

This will remember 75 strings from getDataFromLocalMemorySynchronously(), get items eleven–15 and ignore the rest, transform the strings, and print them out. (Learn more than near operators such as accept, skip, and limit in "Slicing and Dicing Using skip(), takeWhile(), and Others".)

An RxJava Appreciable is used the same way:

              // Observable<String>              // that emits 75 strings              getDataFromNetworkAsynchronously              ()              .              skip              (              x              )              .              take              (              5              )              .              map              (              s              ->              s              +              "_transformed"              )              .              subscribe              (              Organisation              .              out              ::              println              )

This will receive 5 strings (15 were emitted but the kickoff 10 were dropped), so unsubscribe (ignoring or stopping the residual of the strings that were to be emitted). It transforms and prints the strings just like the previous Iterable/Stream example.

In other words, the Rx Observable allows programming with async information via push simply like Streams around Iterables and Lists using synchronous pull.

Cardinality

The Appreciable supports asynchronously pushing multiple values. This nicely fits into the lower right of the following table, the async dual of Iterable (or Stream, List, Enumerable, etc.) and multivalued version of a Future:

	1	Many
Synchronous	T getData()	Iterable<T> getData()
Asynchronous	Time to come<T> getData()	Observable<T> getData()

Note that this section refers to Future generically. It uses Future.onSuccess(callback) syntax to represent its behavior. Different implementations exist, such as CompletableFuture, ListenableFuture, or the Scala Future. Just whatever you do, don't use java.util.Hereafter, which requires blocking to call back a value.

So, why might an Observable be valuable instead of just Future? The almost obvious reason is that you are dealing with either an event stream or a multivalued response. The less obvious reason is composition of multiple single-valued responses. Let's look at each of these.

Upshot stream

Event stream is straightforward. Over time the producer pushes events at the consumer, equally demonstrated here:

                // producer                Observable                <                Outcome                >                mouseEvents                =                ...;                // consumer                mouseEvents                .                subscribe                (                e                ->                doSomethingWithEvent                (                due east                ));

This doesn't piece of work very well with a Future:

                // producer                Future                <                Event                >                mouseEvents                =                ...;                // consumer                mouseEvents                .                onSuccess                (                e                ->                doSomethingWithEvent                (                e                ));

The onSuccess callback could have received the "last event," merely some questions remain: Does the consumer now need to poll? Will the producer enqueue them? Or will they exist lost in between each fetch? The Observable is definitely beneficial here. In the absence of Observable, a callback approach would be better than modeling this with a Time to come.

Multiple values

Multivalued responses are the adjacent employ of Observable. Basically, anywhere that a Listing, Iterable, or Stream would be used, Appreciable tin can be used instead:

                // producer                Observable                <                Friend                >                friends                =                ...                // consumer                friends                .                subscribe                (                friend                ->                sayHello                (                friend                ));

Now, this tin can piece of work with a Futurity, like this:

                // producer                Future                <                List                <                Friend                >>                friends                =                ...                // consumer                friends                .                onSuccess                (                listOfFriends                ->                {                listOfFriends                .                forEach                (                friend                ->                sayHello                (                friend                ));                });

So why use the Observable<Friend> arroyo?

If the list of information to render is small, it probably doesn't thing for performance and it becomes a subjective choice. If the list is large, though, or the remote data source must fetch unlike portions of the list from different locations, using the Observable<Friend> arroyo can be a functioning or latency benefit.

The almost compelling reason is that items can be processed equally received rather than waiting for the entire collection to arrive. This is particularly true when dissimilar network latencies on the backend can affect each particular differently, which is actually adequately common due to long-tail latencies (such as in service-oriented or microservice architectures) and shared data stores. If waiting for the entire collection, the consumer will always experience the maximum latency of the aggregate piece of work done for the collection. If items are returned as an Observable stream, the consumer receives them immediately and "time to first detail" tin can exist significantly lower than the last and slowest item. To make this work, ordering of the stream must be sacrified and then that the items can be emitted in whatever order the server gets them. If order is eventually important to the consumer, a ranking or position tin exist included in the item data or metadata, and the client can then sort or position the items as needed.

Additionally, information technology keeps retention usage limited to that needed per item rather than needing to classify and collect memory for the unabridged collection.

Composition

A multivalued Appreciable blazon is also useful when composing single-valued responses, such as from Futures.

When merging together multiple Futures, they emit some other Time to come with a single value, such every bit this:

                CompletableFuture                <                String                >                f1                =                getDataAsFuture                (                1                );                CompletableFuture                <                Cord                >                f2                =                getDataAsFuture                (                two                );                CompletableFuture                <                Cord                >                f3                =                f1                .                thenCombine                (                f2                ,                (                x                ,                y                )                ->                {                return                x                +                y                ;                });

That might exist exactly what is wanted, and is actually available in RxJava via Appreciable.goose egg (which yous'll learn more most in "Pairwise Composing Using zip() and zipWith()"):

                Observable                <                String                >                o1                =                getDataAsObservable                (                1                );                Appreciable                <                Cord                >                o2                =                getDataAsObservable                (                2                );                Appreciable                <                String                >                o3                =                Observable                .                zip                (                o1                ,                o2                ,                (                ten                ,                y                )                ->                {                return                x                +                y                ;                });

Still, it ways waiting until all Futures are completed before emitting anything. Often, information technology is preferable to emit each returned Hereafter value every bit it completes. In this case, Observable.merge (or the related flatMap) is preferable. It allows composing the results (even if each is only an Observable emitting one value) into a stream of values that are each emitted equally soon as they are gear up:

                Observable                <                String                >                o1                =                getDataAsObservable                (                ane                );                Appreciable                <                String                >                o2                =                getDataAsObservable                (                2                );                // o3 is now a stream of o1 and o2 that emits each item without waiting                Observable                <                String                >                o3                =                Appreciable                .                merge                (                o1                ,                o2                );

Single

Now, despite Rx Appreciable being keen at handling multivalued streams, the simplicity of a single-valued representation is very nice for API design and consumption. Additionally, bones request/response behavior is extremely common in applications. For this reason, RxJava provides a Single type, which is a lazy equivalent to a Futurity. Think of it as a Future with two benefits: first, it is lazy, and so it can be subscribed to multiple times and easily composed, and 2nd, it fits the RxJava API, so it tin easily collaborate with an Observable.

For example, consider these accessors:

                public                static                Single                <                String                >                getDataA                ()                {                render                Unmarried                .<                String                >                create                (                o                ->                {                o                .                onSuccess                (                "DataA"                );                }).                subscribeOn                (                Schedulers                .                io                ());                }                public                static                Single                <                Cord                >                getDataB                ()                {                return                Single                .                just                (                "DataB"                )                .                subscribeOn                (                Schedulers                .                io                ());                }

These tin and so be used and optionally composed similar this:

                // merge a & b into an Appreciable stream of 2 values                Appreciable                <                String                >                a_merge_b                =                getDataA                ().                mergeWith                (                getDataB                ());

Note how 2 Singles are merged into an Observable. This could issue in an emission of [A, B] or [B, A], depending on which completes starting time.

Going dorsum to the previous instance, we tin can now use Single instead of Observable to represent the information fetches, but merge them into a stream of values:

                // Observable<String> o1 = getDataAsObservable(ane);                // Observable<String> o2 = getDataAsObservable(2);                Single                <                String                >                s1                =                getDataAsSingle                (                ane                );                Unmarried                <                Cord                >                s2                =                getDataAsSingle                (                ii                );                // o3 is now a stream of s1 and s2 that emits each detail without waiting                Observable                <                Cord                >                o3                =                Single                .                merge                (                s1                ,                s2                );

Using Single instead of Observable to represent a "stream of 1" simplifies consumption because a developer must consider merely the following behaviors for the Unmarried type:

It tin respond with an fault
Never respond
Respond with a success

Compare this with the additional states a consumer must consider with an Observable:

It can reply with an error
Never respond
Respond successfully with no data and terminate
Respond successfully with a unmarried value and cease
Respond successfully with multiple values and terminate
Respond successfully with one or more values and never finish (waiting for more information)

By using Single, the mental model is simpler for consuming the API, and only after composition into an Appreciable happens must a developer consider the additional states. This is often a better identify for it to occur because typically the developer controls that code, whereas the data API is often from a third party.

You lot'll learn more than about Unmarried in "Observable versus Single".

Completable

In addition to Single, RxJava likewise has a Completable type that addresses the surprisingly common use case of having no render type, simply the need to stand for successful or failed completion. Often Observable<Void> or Unmarried<Void> ends up being used. This is bad-mannered, and so Completable came to be, as demonstrated here:

                Completable                c                =                writeToDatabase                (                "data"                );

This use instance is common when doing asynchronous writes for which no render value is expected but notification of successful or failed completion is needed. The preceding code with Completable is similar to this:

                Observable                <                Void                >                c                =                writeToDatabase                (                "data"                );

The Completable itself is an abstraction for ii callbacks, completion and failure, like this:

                static                Completable                writeToDatabase                (                Object                data                )                {                return                Completable                .                create                (                s                ->                {                doAsyncWrite                (                data                ,                // callback for successful completion                ()                ->                s                .                onCompleted                (),                // callback for failure with Throwable                error                ->                s                .                onError                (                error                ));                });                }

Zero to infinity

Observable can support cardinalities from zero to infinity (which is explored more in "Space Streams"). Simply for simplicity and clarity, Single is an "Observable of One," and Completable is an "Observable of None."

With these newly introduced types, our tabular array ends up looking like this:

	Zip	One	Many
Synchronous	void doSomething()	T getData()	Iterable<T> getData()
Asynchronous	Completable doSomething()	Single<T> getData()	Observable<T> getData()

Mechanical Sympathy: Blocking versus Nonblocking I/O

Thus far, the argument for the reactive-functional manner of programming has primarily been well-nigh providing an abstraction over async callbacks to let more than manageable limerick. And, it is fairly obvious that performing unrelated network requests concurrently rather than sequentially is beneficial to experienced latency, thus the reason for adopting asynchrony and needing composition.

But is there an efficiency reason for adopting the reactive approach (either imperative or functional) in how nosotros perform I/O? Are there benefits to using nonblocking I/O, or is blocking I/O threads to look on a single network request okay? Performance testing I was involved in at Netflix demonstrated that there are objective and measurable efficiency benefits to adopting nonblocking I/O and event loops over thread-per-request blocking I/O. This section provides reasons why this is the case as well as the data to help you brand your own decision.

Equally referenced in "The Pursuit of Answers", tests were done to compare performance of blocking and nonblocking I/O with Tomcat and Netty on Linux. Considering this blazon of testing is ever controversial and hard to go right, I'll be very articulate that this test is only intended to be relevant for the post-obit:

Behavior on typical Linux systems being used around 2015/2016
Java eight (OpenJDK and Oracle)
Unmodified Tomcat and Netty every bit used in typical production environments
Representative spider web service request/response workload involving composition of multiple other web services

Considering that context, we learned the following:

Netty code is more efficient than Tomcat code, assuasive it to consume less CPU per request.
The Netty event-loop architecture reduces thread migrations under load, which improves CPU cache warmth and retentiveness locality, which improves CPU Instructions-per-Cycle (IPC), which lowers CPU cycle consumption per request.
Tomcat lawmaking has higher latencies under load due to its thread pool compages, which involves thread pool locks (and lock contention) and thread migrations to service load.

The following graph best illustrates the difference between the architectures:

Annotation how the lines diverge as load increases. These are the thread migrations. The most interesting thing I learned was that the Netty application actually becomes more efficient every bit information technology is put under load and the threads become "hot" and stick to a CPU cadre. Tomcat, on the other hand, has a separate thread per request and thus cannot gain this benefit and retains higher thread migrations due to each thread needing to be scheduled for every request.

Netty CPU consumption remains mostly flat through increasing load and actually becomes slightly more than efficient as the load is maxed out, equally opposed to Tomcat, which becomes less efficient.

The resulting touch on on latency and throughput is seen in the post-obit graph:

Despite averages not being very valuable (as opposed to percentiles), this graph shows how both take similar latency with niggling load, but diverge significantly as load increases. Netty is able to meliorate utilize the auto until higher load with less impact on latency:

This graph of maximum latency was called to evidence how the outliers affect users and organization resources. Netty handles load far more gracefully and avoids the worst-case outliers.

The post-obit image shows throughput:

Two potent benefits come up out of these findings. First, ameliorate latency and throughput means both better user feel and lower infrastructure cost. Second, though, the issue-loop architecture is more resilient under load. Instead of falling autonomously when the load is increased, the machine can exist pushed to its limit and handles it gracefully. This is a very compelling argument for large-scale product systems that need to handle unexpected spikes of traffic and remain responsive.

I as well found the event-loop architecture easier to operate. It does not¹ crave tuning to get optimal performance, whereas the thread-per-request architecture oftentimes needs tweaking of thread puddle sizes (and subsequently garbage drove) depending on workload.

This is not intended to be an exhaustive study of the topic, but I institute this experiment and resulting information equally compelling testify for pursuing the "reactive" architecture in the form of nonblocking IO and upshot loops. In other words, with hardware, the Linux kernel, and JVM circa 2015/2016, nonblocking I/O via upshot loops does have benefits.

Using Netty with RxJava will be further explored afterwards in "Nonblocking HTTP Server with Netty and RxNetty".

Reactive Abstraction

Ultimately RxJava types and operators are merely an abstraction over imperative callbacks. However, this brainchild completely changes the coding style and provides very powerful tools for doing async and nonblocking programming. Information technology takes effort to learn and requires a shift of thinking to exist comfortable with role composition and thinking in streams, but when you've achieved this it is a very constructive tool aslope our typical object-oriented and imperative programming styles.

The residue of this book takes you through the many details of how RxJava works and how to utilize it. Chapter 2 explains where Appreciabledue south come from and how you can consume them. Chapter three will guide you through several dozen declarative and powerful transformations.

¹ Beyond perhaps debating when the number of event loops is sized at 1x, 1.5x, or 2x the number of cores. I take not found strong differences between these values, though, and generally default to 1x.

olsoncoving.blogspot.com

Source: https://www.oreilly.com/library/view/reactive-programming-with/9781491931646/ch01.html