程式扎記: 2月 2018

Automatic parallel processing of lists
Most computations that are applied to lists resort to folds. A fold involves applying an operation as many times as there are elements in the list. For very long lists and long-lasting operations, a fold can take a considerable amount of time. Because most computers are now equipped with multicore processors (if not multiple processors), you may be tempted to find a way to make the computer process a list in parallel.

In order to parallelize a fold, you need only one thing (beside a multicore processor, of course): an additional operation allowing you to recompose the results of each parallel computation.

Not all computations can be parallelized
Take the example of a list of integers. Finding the mean of all integers isn’t something you can directly parallelize. You could break the list into four pieces (if you have a computer with four processors) and compute the mean of each sublist. But you wouldn’t be able to compute the mean of the list from the means of the sublists. On the other hand, computing the mean of a list implies computing the sum of all elements and then dividing it by the number of elements. And computing the sum is something that can be easily parallelized by computing the sums of the sublists, and then computing the sum of the sublist sums.

This is a very particular example, where the operation used for the fold (the addition) is the same as the operation used to assemble the sublist results. This isn’t always the case. Take the example of a list of characters that’s folded by adding a character to a String. To assemble the intermediate results, you need a different operation: string concatenation.

Breaking the list into sublists
First, you must break the list into sublists, and you must do this automatically. One important question is how many sublists you should obtain. At first glance, you might think that one sublist for each available processor would be ideal, but this isn’t exactly the case. The number of processors (or, more precisely, the number of logical cores) isn’t the most important factor. There’s a more crucial question: will all sublist computations take the same amount of time? Probably not, but this depends on the type of computation. If you were to divide the list into four sublists because you decided to dedicate four threads to parallel processing, some threads might finish very quickly, while others might have to make a much longer computation. This would ruin the benefit of parallelization, because it might result in most of the computing task being handled by a single thread.

A better solution is to divide the list into a large number of sublists, and then submit each sublist to a pool of threads. This way, as soon as a thread finishes processing a sublist, it’s handed a new one to process. So the first task is to create a method that will divide a list into sublists.

Write a divide(int depth) method that will divide a list into a number of sublists. The list will be divided in two, and each sublist recursively divided in two, with the depth parameter representing the number of recursion steps. This method will be implemented in the List parent class with the following signature:

view plaincopy to clipboardprint?
List<List<A>> divide(int depth)  

Let's first define a new version of the splitAt method that returns a list of lists instead of a Tuple<List, List>. Let’s call this method splitListAt and give it the following signature:

view plaincopy to clipboardprint?
List<List<A>> splitListAt(int i)  

The splitListAt method is an explicitly recursive method made stack-safe through the use of the TailCall class:

view plaincopy to clipboardprint?
public List<List<A>> splitListAt(int i) {  
  return splitListAt(list(), this.reverse(), i).eval();  
}  
  
private TailCall<List<List<A>>> splitListAt(List<A> acc,  
                                            List<A> list, int i) {  
  return i == 0 || list.isEmpty()  
      ? ret(List.list(list.reverse(), acc))  
      : sus(() -> splitListAt(acc.cons(list.head()), list.tail(), i - 1));  
}  

This method will, of course, always return a list of two lists. Then you can define the divide method as follows:

view plaincopy to clipboardprint?
public List<List<A>> divide(int depth) {  
    return this.isEmpty() ? list(this) : divide(list(this), depth);  
}  
  
private List<List<A>> divide(List<List<A>> list, int depth) {  
    return list.head().length() < depth || depth < 2   
            ? list  
            : divide(list.flatMap(x -> x.splitListAt(x.length() / 2)), depth / 2);  
}  

Note that you don’t need to make this method stack-safe because the number of recursion steps will only be log(length). In other words, you’ll never have enough heap memory to hold a list long enough to cause a stack overflow.

Processing sublists in parallel
To process the sublists in parallel, you’ll need a special version of the method to execute, which will take as an additional parameter an ExecutorService configured with the number of threads you want to use in parallel.

Let's create a parFoldLeft method in List<A> that will take the same parameters as fold-Left plus an v and a function from B to B to B and that will return a Result<List<B>>. The additional function will be used to assemble the results from the sublists. Here’s the signature of the method (Exercise 8.23):

view plaincopy to clipboardprint?
public<B> Result<B> parFoldLeft(ExecutorService es, B identity, Function<B, Function<A, B>> f, Function<B, Function<B, B>> m)  

First, you must define the number of sublists you want to use and divide the list accordingly:

view plaincopy to clipboardprint?
final int chunks = 1024;  
final List<List<A>> dList = divide(chunks);  

Then, you’ll map the list of sublists with a function that will submit a task to the ExecutorService. This task consists of folding each sublist and returning a Future instance. The list of Future instances is mapped to a function calling get on each Future to produce a list of results (one for each sublist). Note that you must catch the potential exceptions.

Eventually, the list of results is folded with the second function, and the result is returned in a Result.Success. In the case of an exception, a Failure is returned.

view plaincopy to clipboardprint?
try {  
  List<B> result = dList.map(x -> es.submit(() -> x.foldLeft(identity,  
                                                         f))).map(x -> {  
    try {  
      return x.get();  
    } catch (InterruptedException | ExecutionException e) {  
      throw new RuntimeException(e);  
    }  
  });  
  return Result.success(result.foldLeft(identity, m));  
} catch (Exception e) {  
  return Result.failure(e);  
}  

You’ll find an example benchmark of this method in the accompanying code (https://github.com/fpinjava/fpinjava). The benchmark consists of computing 10 times the Fibonacci value of 35,000 random numbers between 1 and 30 with a very slow algorithm. On a four-core Macintosh, the parallel version executes in 22 seconds, whereas the serial version needs 83 seconds.

Although mapping can be implemented through a fold (and thus can benefit from automatic parallelization), it can also be implemented in parallel without using a fold. This is probably the simplest automatic parallelization that can be implemented on a list. Create a parMap method that will automatically apply a given function to all elements of a list in parallel. Here’s the method signature (Exercise 8.24):

view plaincopy to clipboardprint?
public <B> Result<List<B>> parMap(ExecutorService es, Function<A, B> g)  

Here’s the solution:

view plaincopy to clipboardprint?
public <B> Result<List<B>> parMap(ExecutorService es, Function<A, B> g) {  
  try {  
    return Result.success(this.map(x -> es.submit(() -> g.apply(x)))  
                                                             .map(x -> {  
      try {  
        return x.get();  
      } catch (InterruptedException | ExecutionException e) {  
        throw new RuntimeException(e);  
      }  
    }));  
  } catch (Exception e) {  
    return Result.failure(e);  
  }  
}  

The benchmark available in the code accompanying this book will allow you to measure the increase in performance. This increase may, of course, vary depending on the machine running the program.

Supplement
* Ch8 - Advanced list handling - Part1
* Ch8 - Advanced list handling - Part2
* Java 文章收集 - ExecutorService usage tutorial

Source From Here
Question
How would one create an iterative function (or iterator object) in python?

How-To
Iterator objects in python conform to the iterator protocol, which basically means they provide two methods: __iter__() and next(). The __iter__ returns the iterator object and is implicitly called at the start of loops. The next() method returns the next value and is implicitly called at each loop increment. next() raises a StopIteration exception when there are no more value to return, which is implicitly captured by looping constructs to stop iterating.

Here's a simple example of a counter:

view plaincopy to clipboardprint?
class Counter:  
    def __init__(self, low, high):  
        self.current = low  
        self.high = high  
  
    def __iter__(self):  
        return self  
  
    def next(self): # Python 3: def __next__(self)  
        if self.current > self.high:  
            raise StopIteration  
        else:  
            self.current += 1  
            return self.current - 1  
  
  
for c in Counter(3, 8):  
    print c  

This will print:

3
4
5
6
7
8

This is easier to write using a generator, as covered in a previous answer:

view plaincopy to clipboardprint?
def counter(low, high):  
    current = low  
    while current <= high:  
        yield current  
        current += 1  
  
for c in counter(3, 8):  
    print c  

The printed output will be the same. Under the hood, the generator object supports the iterator protocol and does something roughly similar to the class Counter.

David Mertz's article, Iterators and Simple Generators, is a pretty good introduction.

Supplement
* FAQ - What makes something iterable in python
* FAQ - In Python, how do I determine if an object is iterable?

程式扎記

標籤

2018年2月25日星期日

[ FP with Java ] Ch8 - Advanced list handling - Part2

[ Python 常見問題 ] Build a Basic Python Iterator

[Git 常見問題] error: The following untracked working tree files would be overwritten by merge

檢舉濫用情形

學習筆記

標籤

2018年2月25日 星期日