News
Math Behind Machine Learning Gets 10-100X Faster with Anaconda Inside Hadoop
AUSTIN, TX—February 29, 2016—Continuum Analytics, the creator and driving force behind Anaconda, the leading modern open source analytics platform powered by Python, today announced advancements in Anaconda bringing high performance advanced analytics to Hadoop. In addition to providing Python and R packages alongside Hadoop clusters, Anaconda will include a distributed processing framework for Hadoop that interacts directly with HDFS and YARN. With Anaconda’s new capabilities inside Hadoop, data scientists can finally achieve lightning fast processing of computationally intensive machine learning analytics to realize the full value of their Big Data.
Anaconda, already known for high performance Python, is now delivering high performance for Hadoop. Anaconda is leading the Open Data Science movement opening up Hadoop through a Python gateway that interacts directly with YARN and HDFS. This allows all Anaconda functionality, including other high performance analytics based on R and MPI, to work with the Hadoop ecosystem. It also bridges the gap between High Performance Computing (HPC) and Big Data to help enterprises unlock the value of the data tied up in their Hadoop cluster.
“There’s so much potential in Hadoop, yet enterprise customers still struggle to unlock all of the computing power in their clusters even with the latest execution engines like Spark. Enterprise customers demand flexibility, high performance and efficient use of memory to scale up their Big Data workloads, especially for heavy duty machine learning. Continuum Analytics is helping enterprises extract value from Big Data,” said Peter Wang, co-founder and CTO of Continuum Analytics. “Anaconda empowers enterprises to get high performance and interactive analytics not only by leveraging the Open Data Science ecosystem--including Python and R--but also by leveraging investments in HPC using MPI. This is breakthrough technology for organizations who want to get high value and high impact data science solutions from Hadoop.”
Anaconda provides high performance computing professionals with a bridge into the Big Data and Hadoop ecosystem using the integrating power of Python. Many HPC professionals have resisted the move to Hadoop because of the lack of proven high performance analytics available on Hadoop. Now these seasoned professionals can leverage their powerful legacy MPI advanced analytics against Hadoop data stores. Similarly, by leveraging the same HDFS and YARN bridge, the vast R community will be able to realize performance gains using parallel R analytics in Hadoop. For enterprises concerned with authentication to their Hadoop data, Anaconda will also include single-sign-on (SSO) via Kerberos.
Continuum Analytics will be demonstrating its latest innovation for the Anaconda platform at Booth #1336 in the Strata & Hadoop World San Jose exhibition hall. If you are a customer, prospect, analyst or journalist and interested in meeting with Continuum Analytics at the event, please reach out to continuum@treblepr.com.
The new capabilities in Anaconda will be generally available by April of 2016.
About Continuum Analytics
Continuum Analytics is the creator and driving force behind Anaconda, the leading, modern open source analytics platform powered by Python. We put superpowers into the hands of people who are changing the world.
With more than 2.25M downloads annually and growing, Anaconda is trusted by the world’s leading businesses across industries – financial services, government, health & life sciences, technology, retail & CPG, oil & gas – to solve the world’s most challenging problems. Anaconda does this by helping everyone in the data science team discover, analyze, and collaborate by connecting their curiosity and experience with data. With Anaconda, teams manage their open data science environments without any hassles to harness the power of the latest open source analytic and technology innovations.
Our community loves Anaconda because it empowers the entire data science team – data scientists, developers, DevOps, data engineers, and business analysts – to connect the dots in their data and accelerate the time-to-value that is required in today’s world. To ensure our customers are successful, we offer comprehensive support, training and professional services.
Continuum Analytics' founders and developers have created or contribute to some of the most popular open data science technologies, including NumPy, SciPy, Matplotlib, pandas, Jupyter/IPython, Bokeh, Numba and many others. Continuum Analytics is venture-backed by General Catalyst and BuildGroup.
To learn more about Continuum Analytics, visit www.continuum.io.
###