What is the best Python implementation in MapReduce mode?

What is the best Python implementation of MapReduce, a framework or library, may be as good as Apache hadoop, but if it is only in Python, and the best in terms of good documentation and ease of understanding Good, fully implement the MapReduce mode, high scalability, high stability, and light weight.

I searched for something called mincemeat on Google, not sure about it, but anyone else knows NS?

Thank you

If you search for them, there are some sections here and there. For example, Octopy and Disco and Hadoopy.

However, I don’t think any of them can compete with Hadoop in terms of maturity, stability, scalability, performance, etc. For small cases, They should be enough, but for something more “honorable”, you have to stick to Hadoop.

Remember, you can still use python/jython to write map/reduce programs in Hadoop.

Edit: I recently came across mrjob. This looks great because it simplifies the way to write map/reduce programs and then launch them on Hadoop or Amazon’s Elastic MapReduce platform. The article that passed this good news is here< /p>

What is the best Python implementation of MapReduce, a framework or library, may be as good as Apache hadoop, but if it is only in Python, and in good documentation and easy The best understanding, fully implements the MapReduce mode, high scalability, high stability, and light weight.

I searched for one called mincemeat on Google, not sure about it, but anything else Does anyone know?

Thank you

If you search for them, there are some parts here and there. For example Octopy and Disco and Hadoopy.

However, I don’t think any of them can compete with Hadoop in terms of maturity, stability, scalability, performance, etc. For small cases, they should be enough, but for something more “honorable” , You must stick to Hadoop.

Please remember that you can still use python/jython to write map/reduce programs in Hadoop.

Edit: I recently encountered mrjob. This It looks great because it simplifies the way to write map/reduce programs and then launch them on Hadoop or Amazon’s Elastic MapReduce platform. The article that passed this good news is here

p>

Leave a Comment

Your email address will not be published.