Remember me

Register  |   Lost password?


 

Recent members

 

 

Show:
The Practical Quant wrote a new blog post titled Labeling, transforming, and structuring training data sets for machine learning
The O'Reilly Data Show Podcast: Alex Ratner on how to build and manage training data with Snorkel.In this episode of the Data Show, I speak with Alex Ratner, project lead for Stanford’s Snorkel open source project; Ratner also recently garnered a faculty position at the University of Washington and is currently working on a company supporting and extending the Snorkel project. Snorkel is a framework for building and managing training data. Based on our survey from earlier this year, labeled data remains a key bottleneck for organizations building machine learning applications and...
10 days ago
The Practical Quant wrote a new blog post titled Got speech? These guidelines will help you get started building voice applications
Speech adds another level of complexity to AI applications - today’s voice applications provide a very early glimpse of what is to come.By Ben Lorica and Yishay Carmiel.As companies begin to explore AI technologies, three areas in particular are garnering a lot of attention: computer vision, natural language applications, and speech technologies. A recent report from the World Intellectual Patent Office (WIPO) found that together these three areas accounted for a majority of patents related to AI: computer vision (49% of all patents), natural language processing (NLP) (14%), and speech...
17 days ago
The Practical Quant wrote a new blog post titled Make data science more useful
The O'Reilly Data Show Podcast: Cassie Kozyrkov on connecting data and AI to business.In this episode of the Data Show, I speak with Cassie Kozyrkov, technical director and chief decision scientist at Google Cloud. She describes "decision intelligence" as an interdisciplinary field concerned with all aspects of decision-making, and which combines data science with the behavioral sciences. Most recently she has been focused on developing best practices that can help practitioners make safe, effective use of AI and data. Kozyrkov uses her platform to help data scientists develop skills that...
24 days ago
The Practical Quant wrote a new blog post titled One simple graphic: Researchers love PyTorch and TensorFlow
Interest in PyTorch among researchers is growing rapidly.In a recent survey—AI Adoption in the Enterprise, which drew more than 1,300 respondents—we found significant usage of several machine learning (ML) libraries and frameworks. About half indicated they used TensorFlow or scikit-learn, and a third reported they were using PyTorch or Keras.I recently attended an interesting RISELab presentation delivered by Caroline Lemieux describing recent work on AutoPandas and automation tools that rely on program synthesis. In the course of her presentation, Lemieux reviewed usage statistics they had...
31 days ago
The Practical Quant wrote a new blog post titled Acquiring and sharing high-quality data
The O'Reilly Data Show Podcast: Roger Chen on the fair value and decentralized governance of data.In this episode of the Data Show, I spoke with Roger Chen, co-founder and CEO of Computable Labs, a startup focused on building tools for the creation of data networks and data exchanges. Chen has also served as co-chair of O'Reilly's Artificial Intelligence Conference since its inception in 2016. This conversation took place the day after Chen and his collaborators released an interesting new white paper, Fair value and decentralized governance of data. Current-generation AI and...
38 days ago
The Practical Quant wrote a new blog post titled You'll want Nexar's newly released Live Map for your city
Extracting and exposing valuable insights to enable smart cities and many other applications.I recently had the privilege of getting a preview of Nexar's Live Map, from my friend, Nexar's CTO and co-founder Bruno Fernandez-Ruiz. Nexar uses off-the-shelf smartphones and dash-cams, sophisticated data ingestion, data processing, sensor fusion, and machine learning software to realize their vision of creating the largest safe driving network. To date the company has recorded many miles of driving video ("100 million miles, and more than 10 million miles every month"). This means the company now...
39 days ago
The Practical Quant wrote a new blog post titled Managing machine learning in the enterprise: Lessons from banking and health care
A look at how guidelines from regulated industries can help shape your ML strategy.By Ben Lorica, Harish Doddi, David Talby.As companies use machine learning (ML) and AI technologies across a broader suite of products and services, it’s clear that new tools, best practices, and new organizational structures will be needed. In recent posts, we described requisite foundational technologies needed to sustain machine learning practices within organizations, and specialized tools for model development, model governance, and model operations/testing/monitoring.What cultural and organizational...
41 days ago
The Practical Quant wrote a new blog post titled Tools for machine learning development
The O'Reilly Data Show: Ben Lorica chats with Jeff Meyerson of Software Engineering Daily about data engineering, data architecture and infrastructure, and machine learning.By Jenn Webb.In this week's episode of the Data Show, we're featuring an interview Data Show host Ben Lorica participated in for the Software Engineering Daily Podcast, where he was interviewed by Jeff Meyerson. Their conversation mainly centered around data engineering, data architecture and infrastructure, and machine learning (ML).Here are a few highlights:Tools for productive collaborationA data catalog, at a high...
53 days ago
The Practical Quant wrote a new blog post titled RISELab’s AutoPandas hints at automation tech that will change the nature of software development
Neural-backed generators are a promising step toward practical program synthesis.There's a lot of hype surrounding AI, but are companies actually beginning to use AI technologies? In a survey we released earlier this year, we found that more than 60% of respondents worked in organizations that planned to invest some of their IT budgets into AI. We also found that the level of investment depended on how much experience a company already had with AI technologies, with companies further along the maturity curve planning substantially higher investments. As far as current levels of adoption, the...
55 days ago
The Practical Quant wrote a new blog post titled One simple chart: Who is interested in Spark NLP?
As we close in on its two-year anniversary, Spark NLP is proving itself a viable option for enterprise use.In July 2016, I broached the idea for an NLP library aimed at Apache Spark users to my friend David Talby. A little over a year later, Talby and his collaborators announced the release of Spark NLP. They described the motivation behind the project in their announcement post and in this accompanying podcast that Talby and I wrote, as well as in this recent post comparing popular open source NLP libraries. [Full disclosure: I’m an advisor to Databricks, the startup founded by the team that...
59 days ago
The Practical Quant wrote a new blog post titled AI and machine learning will require retraining your entire organization
To successfully integrate AI and machine learning technologies, companies need to take a more holistic approach toward training their workforce.In our recent surveys AI Adoption in the Enterprise and Machine Learning Adoption in the Enterprise, we found growing interest in AI technologies among companies across a variety of industries and geographic locations. Our findings align with other surveys and studies—in fact, a recent study by the World Intellectual Patent Office (WIPO) found that the surge in research in AI and machine learning (ML) has been accompanied by an even stronger growth in...
60 days ago
The Practical Quant wrote a new blog post titled What are model governance and model operations?
A look at the landscape of tools for building and deploying robust, production-ready machine learning models.Our surveys over the past couple of years have shown growing interest in machine learning (ML) among organizations from diverse industries. A few factors are contributing to this strong interest in implementing ML in products and services. First, the machine learning community has conducted groundbreaking research in many areas of interest to companies, and much of this research has been conducted out in the open via preprints and conference presentations. We are also beginning to see...
62 days ago
The Practical Quant wrote a new blog post titled Enabling end-to-end machine learning pipelines in real-world applications
The O'Reilly Data Show Podcast: Nick Pentreath on overcoming challenges in productionizing machine learning models.In this episode of the Data Show, I spoke with Nick Pentreath, principal engineer at IBM. Pentreath was an early and avid user of Apache Spark, and he subsequently became a Spark committer and PMC member. Most recently his focus has been on machine learning, particularly deep learning, and he is part of a group within IBM focused on building open source tools that enable end-to-end machine learning pipelines.We had a great conversation spanning many topics, including:AI Fairness...
65 days ago
The Practical Quant wrote a new blog post titled The quest for high-quality data
[A version of this post appears on the O'Reilly Radar.]Machine learning solutions for data integration, cleaning, and data generation are beginning to emerge.By Ihab Ilyas and Ben Lorica.“AI starts with ‘good’ data” is a statement that receives wide agreement from data scientists, analysts, and business owners. There has been a significant increase in our ability to build complex AI models for predictions, classifications, and various analytics tasks, and there’s an abundance of (fairly easy-to-use) tools that allow data scientists and analysts to provision complex models within days. As...
67 days ago
The Practical Quant wrote a new blog post titled AI adoption is being fueled by an improved tool ecosystem
[A version of this post appears on the O'Reilly Radar.]We now are in the implementation phase for AI technologies.In this post, I share slides and notes from a keynote that Roger Chen and I gave at the 2019 Artificial Intelligence conference in New York City. In this short summary, I highlight results from a — survey (AI Adoption in the Enterprise) and describe recent trends in AI. Over the past decade, AI and machine learning (ML) have become extremely active research areas: the web site arxiv.org had an average daily upload of around 100 machine learning papers in 2018. With all the...
75 days ago
The Practical Quant wrote a new blog post titled Bringing scalable real-time analytics to the enterprise
[A version of this post appears on the O'Reilly Radar.]The O'Reilly Data Show Podcast: Dhruba Borthakur and Shruti Bhat on enabling interactive analytics and data applications against live data.In this episode of the Data Show, I spoke with Dhruba Borthakur (co-founder and CTO) and Shruti Bhat (SVP of Marketing) of Rockset, a startup focused on building solutions for interactive data science and live applications. Borthakur was the founding engineer of HDFS and creator of RocksDB, while Bhat is an experienced product and marketing executive focused on enterprise software and data products....
80 days ago
The Practical Quant wrote a new blog post titled Applications of data science and machine learning in financial services
[A version of this post appears on the O'Reilly Radar.]The O'Reilly Data Show Podcast: Jike Chong on the many exciting opportunities for data professionals in the U.S. and China.In this episode of the Data Show, I spoke with Jike Chong, chief data scientist at Acorns, a startup focused on building tools for micro-investing. Chong has extensive experience using analytics and machine learning in financial services, and he has experience building data science teams in the U.S. and in China.We had a great conversation spanning many topics, including:Potential applications of data science in...
94 days ago
The Practical Quant wrote a new blog post titled Becoming a machine learning company means investing in foundational technologies
[A version of this post appears on the O'Reilly Radar.]Companies successfully adopt machine learning either by building on existing data products and services, or by modernizing existing models and algorithms.In this post, I share slides and notes from a keynote I gave at the Strata Data Conference in London earlier this year. I will highlight the results of a recent survey on machine learning adoption, and along the way describe recent trends in data and machine learning (ML) within companies. This is a good time to assess enterprise activities, as there are many indications a number of...
96 days ago
The Practical Quant wrote a new blog post titled How AI and machine learning are improving customer experience
[A version of this post appears on the O'Reilly Radar.]From data quality to personalization, to customer acquisition and retention, and beyond, AI and ML will shape the customer experience of the future.By Ben Lorica and Mike Loukides.What can artificial intelligence (AI) and machine learning (ML) do to improve customer experience? AI and ML already have been intimately involved in online shopping since, well, the beginning of online shopping. You can’t use Amazon or any other shopping service without getting recommendations, which are often personalized based on the vendor’s understanding of...
103 days ago
The Practical Quant wrote a new blog post titled Real-time entity resolution made accessible
[A version of this post appears on the O'Reilly Radar.]The O'Reilly Data Show Podcast: Jeff Jonas on the evolution of entity resolution technologies.In this episode of the Data Show, I spoke with Jeff Jonas, CEO, founder and chief scientist of Senzing, a startup focused on making real-time entity resolution technologies broadly accessible. He was previously a fellow and chief scientist of context computing at IBM. Entity resolution (ER) refers to techniques and tools for identifying and linking manifestations of the same entity/object/individual. Ironically, ER itself has many different names...
108 days ago