The NVIDIA RAPIDS Accelerator for Apache Spark software plug-in pioneered a zero code change user experience (UX) for GPU-accelerated data processing. It accelerates existing Apache Spark SQL and DataFrame-based applications on NVIDIA GPUs by over 9x without requiring a change to your queries or source code. This led to the new Spark RAPIDS ML Python library, which can speed up��
]]>Spark RAPIDS ML is an open-source Python package enabling NVIDIA GPU acceleration of PySpark MLlib. It offers PySpark MLlib DataFrame API compatibility and speedups when training with the supported algorithms. See New GPU Library Lowers Compute Costs for Apache Spark ML for more details. PySpark MLlib DataFrame API compatibility means easier incorporation into existing PySpark ML applications��
]]>Apache Spark is an industry-leading platform for distributed extract, transform, and load (ETL) workloads on large-scale data. However, with the advent of deep learning (DL), many Spark practitioners have sought to add DL models to their data processing pipelines across a variety of use cases like sales predictions, content recommendations, sentiment analysis, and fraud detection. Yet��
]]>When you see a context-relevant advertisement on a web page, it��s most likely content served by a Taboola data pipeline. As the leading content recommendation company in the world, a big challenge for Taboola was the frequent need to scale Apache Spark CPU cluster capacity to address the constantly growing compute and storage requirements. Data center capacity and hardware costs are always��
]]>XGBoost is a decision-tree�Cbased, ensemble machine learning algorithm based on gradient boosting. However, until recently, it didn��t natively support categorical data. Categorical features had to be manually encoded before they could be used for training or inference. In the case of ordinal categories, for example, school grades, this is often done using label encoding where each category is��
]]>Data is one of the most valuable assets that a business can possess. It sits at the core of data science and data analysis: without data, they��re both obsolete. Businesses that actively collect data may have a competitive advantage over those that do not. With sufficient data, organizations can better determine the cause of problems and make informed decisions. There are scenarios where an��
]]>Data is the lifeblood of modern enterprises, whether you��re a retailer, financial service company, or digital advertiser. Across industries, organizations are recognizing the importance of their data for business analytics, machine learning, and AI. Smart businesses are investing in new ways to extract value from their data: to better understand customer needs and behaviors��
]]>Any business or industry, from retail and healthcare to financial services, is subject to fraud. The cost of fraud can be staggering. Every $1 of fraud loss costs financial firms about $4 to mitigate. Online sellers will lose $130B to online payment fraud between 2018 and 2023. By using AI and big data analytics, enterprises can efficiently prevent fraud attempts in real time.
]]>Data engineering and data science workflows are often limited by the ability of platforms to process massively growing amounts of data. The integration of the Cloudera Data Platform (CDP), the RAPIDS Accelerator for Apache Spark 3.0, and NVIDIA computing, announced April 12, 2021, enables accelerated and scalable big data pre-processing, and workflows without code changes. With Cloudera CDP and��
]]>Dask is an accessible and powerful solution for natively scaling Python analytics. Using familiar interfaces, it allows data scientists familiar with PyData tools to scale big data workloads easily. Dask is such a powerful tool that we have adopted it throughout a variety of projects at NVIDIA. When paired with RAPIDS, data practitioners can distribute big data workloads across massive NVIDIA GPU��
]]>The Italian multinational oil giant Eni deployed a 18.6 petaflops GPU-accelerated supercomputer, making it the most powerful industrial system in the world. Located outside Milan, the new HPC4 machine will scan for oil and gas reservoirs deep below the Earth. ��This is where the company��s heart is, where we hold our most delicate data and proprietary technology,�� Eni Chief Executive Officer��
]]>Vivek Venugopalan, a staff research scientist at the United Technologies Research Center (UTRC) shares how they are using deep learning and GPUs to understand the life of an aircraft engine and predictive maintenance for elevators in high-rise buildings. ��GPUs have helped us arrive at solutions quickly for computationally intensive challenges across all UTRC platforms, especially in this era of��
]]>Astronomers from around the world pointed their powerful telescopes towards a supermassive black hole that lies in the center of the Milky Way (nearly 26,000 light years from Earth) and believe they have snapped the first-ever picture of a black hole. It will take months to develop the image, but if the scientists succeed the results may help reveal the mysteries about what the universe is made��
]]>Instacart, an Internet-based grocery delivery service, shares how they are using deep learning to help their tens of thousands personal shoppers be more efficient. ��By observing how our shoppers have picked millions of customer orders through our app, we have built models that predict the sequences our fastest shoppers will follow,�� mentions Instacart VP of Data Science Jeremy Stanly in a blog��
]]>Iowa State University researchers are developing a deep learning-based system to help the Iowa Department of Transportation improve incident detection and support operator decision-making. ��There is more data than you could ever imagine coming out of this system,�� said Neal Hawkins, the associate director of Iowa State��s Institute for Transportation. ��We��re getting data every 20 seconds from all��
]]>A team of researchers from University of Alberta, Charles University in Prague and Czech Technical University developed an AI system called DeepStack that defeated professional poker players, making it the first computer program to beat professional players in heads-up no-limit Texas hold��em poker. ��Poker has been a longstanding challenge problem in artificial intelligence,�� says Michael Bowling��
]]>Researchers at the NYU Tandon School of Engineering are developing an artificial intelligence system for autonomous vehicles that links them to HERE Live Map cloud-based mapping system. ��Essentially, we want to be able to precisely match what the car sees with what��s in the cloud database. An incredibly precise ruler isn��t of much use if your vision is blurry,�� explained Yi Fang��
]]>Joel Dudley, an Associate Professor at the Icahn School of Medicine at Mount Sinai in New York shares how he is developing and applying advanced computational methods to integrate the digital universe of information to build better predictive models of diseases and drug response. Using TITAN X GPUs and deep learning, his lab is training their neural networks on electronic health records to��
]]>San Francisco-based fashion startup Stitch Fix is applying deep learning to match their customers with personalized clothing recommendations. Using TITAN X GPUs and Tesla K80 GPUs on the Amazon cloud, along with CUDA and cuDNN to train their deep learning models, Stich Fix��s natural language processing algorithms decode written answers from customers�� feedback on what they liked or disliked��
]]>Citi Ventures made a strategic investment in Feedzai which uses deep learning to provide real-time fraud prevention in ecommerce and banking. Using CUDA, GTX 1080 GPUs and cuDNN with TensorFlow to train their deep learning models, Feedzai��s platform is able to scan large amounts of data to recognize evolving threats and then alerts customers in real-time to protect against fraud.
]]>Big data visualization startup Zebra Medical Vision took home the inaugural award for their significant advancements of Artificial Intelligence and deep learning using NVIDIA GPUs. The world has an aging population and the demand for medical imaging services is quickly outpacing the supply of radiologists. Israel-based Zebra Medical Vision uses big data to deliver large-scale clinical research��
]]>A startup from California is using GPUs and big data to predict what homes are likely to buy solar panels. PowerScout is using GPUs on the Amazon cloud and cuDNN to train their deep learning models on a mix of data from commercial databases and LIDAR to detect solar panels from satellite images, and to also detect the presence of trees near homes that could cast shade onto roofs.
]]>Researchers from University of Massachusetts Amherst and Mount Holyoke College received a four-year grant from the National Science Foundation to analyze images and data on the chemical composition of rocks and dust from NASA��s Curiosity rover. The rover has been exploring a crater on Mars since 2012 and sends back large amounts of data collected using a process called laser-induced breakdown��
]]>Joshua Patterson, principal data scientist of Accenture Labs shares how his team is using NVIDIA GPUs and GPU-accelerated libraries to quickly detect security threats by analyzing anomalies in large-scale network graphs. ��When we can move 4 billion node graphs onto a GPU and have the shared memory of all the other GPUs and have that connected processing power�� it��s really going to cut-out months��
]]>Dr. Wojtek Goscinski, High Performance Computing Manager at the Monash University eResearch Center, shares how his team is using supercomputers accelerated by NVIDIA Tesla GPUs to process and analyze big data to create visualizations of large-scale molecular datasets. Dr. Goscinski mentions they are able to process their data within hours with GPUs as opposed to days and weeks. Now��
]]>Adam McLaughlin, PhD student at Georgia Tech shares how he is using NVIDIA Tesla GPUs for his research on Betweenness Centrality �C a graph analytics algorithm that tracks the most important vertices within a network. This can be applied to a broad range of applications, such as finding the head of a crime ring or determining the best location for a store within a city. Using a cluster of GPUs for��
]]>To accelerate biomedical research, Australia��s Monash University boosted its research infrastructure with a third GPU-accelerated supercomputer called MASSIVE-3. MASSIVE-3 is equipped with both Tesla GPUs and Quadro GPUs for data processing and visualization, driving the new system nearly four times faster than MASSIVE-2. Over the past five years, MASSIVE has played a key role in driving��
]]>Jeroen Tromp, Professor at Princeton University shares how his team is using the Tesla GPU-accelerated Titan Supercomputer at Oak Ridge National Laboratory to image the earth��s interior on a global scale. Tromp and his team are simulating seismic wave propagation by analyzing hundreds of earthquakes recorded by thousands of stations across the world to create 3D global tomographic maps.
]]>Rob High, IBM Fellow, VP and chief technology officer for Watson, will deliver a keynote at the GPU Technology Conference in Silicon Valley on April 6. Watson was an overnight sensation when it competed on Jeopardy! against two former winners to win $1 million. Watson is among the first of a new generation of cognitive systems with far-reaching applications. It uses artificial intelligence��
]]>For the first time, astronomers have tracked down the location of a fast radio burst (FRB), confirming these short but spectacular flashes of radio waves originate in the distant universe. Australian astronomers announced last week that they successfully used Tesla GPUs and CSIRO radio telescopes in eastern Australia and Japan��s Subaru telescope in Hawaii to trace the source of the burst to a��
]]>To reveal deeper insights into important activities taking place around the world, DigitalGlobe��s advanced satellite constellation collects nearly 4 million km2 of high-resolution earth imagery each day. The company announced they will now rely on NVIDIA GPUs and deep learning to automatically identify objects such as airplanes, vehicles, and gray elephants, as well as to detect patterns from��
]]>Ornithologists study every aspect of birds, including bird songs, flight patterns, physical appearance, and migration patterns �C and to do so, they use acoustic sensors and cameras placed in remote areas. Conservation Metrics, a California-based company, is using deep learning accelerated with NVIDIA GPUs to help capture the immense amounts of data that would be nearly impossible to analyze��
]]>We love seeing all of the social media posts from developers using NVIDIA GPUs �C here are a few highlights from the week: On Twitter? Follow @GPUComputing and @mention us and/or use hashtags so we��re able to keep track of what you��re up to: #CUDA, #cuDNN, #OpenACC.
]]>Alibaba Group��s cloud computing business, AliCloud, signed a new partnership with NVIDIA to collaborate on AliCloud HPC, the first GPU-accelerated cloud platform for high performance computing (HPC) in China. AliCloud will work with NVIDIA to broadly promote its cloud-based GPU offerings to its customers �� primarily fast-growing startups �C for AI and HPC work. ��Innovative companies in deep��
]]>Ruchir Puri, an IBM Fellow at IBM��s Thomas J. Watson Research Center shares how they are building large-scale big data systems and delivering real-time solutions, such as using machine learning and GPUs to predict drug reactions. Learn more about the work IBM Thomas J. Watson Research Center is doing with GPUs at research.ibm.com/labs/watson Share your GPU-accelerated science with��
]]>Computers can drive, create recipes, even compose rap songs. London audiences will soon find out whether they can write a hit musical, too. The new show, ��Beyond the Fence,�� is the world��s first musical conceived and substantially crafted by computer. Opening next month in the West End, it��s an experiment to determine how technology affects art and the creative process.
]]>Using the NVIDIA Tesla Accelerated Computing Platform in the Amazon Web Services cloud, this Southern California startup is supporting the City of Los Angeles with an initiative to eliminate traffic-related deaths. The company uses a combination of in-house data scientists, proprietary technology and custom data models to help businesses do things like predict a customer��s lifetime value and��
]]>Bryan Catanzaro, Senior Researcher at Baidu, shares how his company is Tesla and TITAN GPUs for a variety of Artificial Intelligence applications, including speech recognition, and natural language understanding. Watch Bryan��s talk in the NVIDIA GPU Technology Theater at SC15: Watch Now Learn more about their research at http://bit.ly/1NE1cPi Share your GPU-accelerated science��
]]>Instagram could offer a novel way of monitoring the drinking habits of teenagers. Using photos and text from Instagram, a team of researchers from the University of Rochester has shown that this data can not only expose patterns of underage drinking more cheaply and faster than conventional surveys, but also find new patterns, such as what alcohol brands or types are favored by different��
]]>For the past 30 years, users of the San Diego Supercomputer Center (SDSC) systems have achieved major scientific breakthroughs spanning many domains, from earth sciences and biology to astrophysics, bioinformatics, and health IT. A few milestones include: 1987: Scientists take a major step in the new arena of rational drug design, determining the relative free energies of binding for different��
]]>Stanford PhD student Andrej Karpathy trained a model overnight on a Tesla K40 to tell you how to take a better selfie photo. Convolutional Neural Networks are great at recognizing things, places and people in your personal photos, crops, traffic, various anomalies in medical images and all kinds of useful things. But once in a while these powerful visual recognition models can also be warped for��
]]>A Santa Cruz, California based company is using big data and deep learning to improve conservation efforts. California birdwatchers can go a lifetime without seeing the globally endangered marbled murelett bird and with now with remote acoustic sensors and deep learning, biologists are now able to analyze the audio of the bird, and keep better track the populations of species that were previously��
]]>To enable deep learning, Yahoo added GPU nodes into their Hadoop clusters with each node having 4 Nvidia Tesla K80 cards, each card with two GK210 GPUs. These nodes have 10x processing power than the traditional commodity CPU nodes they generally use in their Hadoop clusters. Yahoo has progressively invested in building and scaling Apache Hadoop clusters with a current footprint of more than��
]]>Large reservoirs are found at greater depths and in sediments that are much harder to analyze, like the recent Jack Field discovery in the Gulf of Mexico which was found at more than 20,000 feet under the sea floor. To interpret and discover these reservoirs it is necessary to acquire and process huge amounts of seismic data. Oil and gas companies use complex algorithms to process��
]]>MapD, an NVIDIA-funded startup, has created a new type of database to handle terabytes of data and impressive visualizations that relies on GPUs instead of the traditional AMD or Intel chips. Todd Mostak, Founder and CEO of MapD, decided to build a SQL database that relies on graphics processors when he was completing his thesis that analyzed a huge number of tweets. MapD was the winner of GPU��
]]>Training their algorithms with CUDA, researchers from Poland published a paper that presents a new image processing solution that will monitor pavement surfaces by using downward facing cameras placed on the rear of a vehicle. Cracks are the most requiring type of pavement distresses to detect and classify automatically. Due to its nature are easily absorbed by other types of pavement surface��
]]>Companies across nearly all industries are exploring how to use GPU-powered deep learning to extract insights from big data. From self-driving cars to disease-detecting mirrors, the use cases for deep learning is expanding by the day. Since computer scientist Geoff Hinton started using GPUs to train his neural networks, researchers are applying the technology to tough modeling problems in the real��
]]>Australia��s first Data Arena powered by by nine NVIDIA Quadro K6000, with 27,000 CUDA cores lets you literally see, hear and feel data sets through 3D visualization. A few software developers from the University of Technology Sydney built a 3D, 360-degree data visualization room to help researchers and data scientists intuitively explore huge and complex data in great detail.
]]>