Refuel

Software Development

San Francisco, CA 1,104 followers

Clean, labeled data at the speed of thought

See jobs Follow

About us

Generate, annotate, clean and enrich datasets for all your AI needs with Refuel's LLM-powered platform. Simply instruct Refuel on the datasets you need, and let LLMs do the work of creating and labeling data.

Website: https://www.refuel.ai/
External link for Refuel
Industry: Software Development
Company size: 2-10 employees
Headquarters: San Francisco, CA
Type: Privately Held

Locations

Primary

San Francisco, CA, US

Get directions

Employees at Refuel

See all employees

Updates

Refuel reposted this

Rishabh Bhargava

Co-Founder and CEO at Refuel.ai | ex-Stanford, Cloudera, Primer.ai
1w Edited
Report this post
We just benchmarked OpenAI's newest model, GPT-4o mini. Here’s what we learned: 1. GPT-4o mini looks to be OpenAI’s replacement for GPT-3.5-turbo. Amongst our customers, very few were using GPT-3.5-turbo and instead opted for Claude Haiku. However, GPT-4o mini appears to be smarter AND cheaper than Haiku — which is a big deal when competing for the simpler LLM workloads 2. Large, frontier models (ex. GPT-4-turbo/ Claude Opus / Sonnet 3.5) are excellent at complex reasoning, but slow and expensive. Meanwhile, smaller models are a faster and cheaper approach for simpler tasks - extraction, simple summarization, Q&A, etc. 3. Given the huge cost reduction, it’s hard to imagine that this will make any money (if at all) for OpenAI. Could this be a loss leader for OpenAI to make it harder for enterprise leaders to justify open source approaches? In either case, the win for LLM consumers is that we’re at the start of a race to the bottom for the cost of intelligence. This is a great time to build with LLMs.
6 Comments

Like Comment Share
Refuel reposted this

Rishabh Bhargava

Co-Founder and CEO at Refuel.ai | ex-Stanford, Cloudera, Primer.ai
2w
Report this post
AMD’s $665M acquisition of Silo AI is a bigger deal than most think, and strategically positions them to take on the 800 pound gorilla in the room - NVIDIA. NVIDIA has long followed an approach of building models, frameworks, benchmarks and other products to showcase how enterprises can leverage NVIDIA as their hardware provider for AI workloads. This is evidenced through NVDIA’s Omniverse, NIM, and Optix products - all of which have helped them get a leg up in the AI arms race. AMD’s acquisition of Silo AI now allows them to play the same ballgame. By making it easier to build solutions on top of their hardware (commoditizing their complements), AMD can simultaneously compete with Nvidia's strategy while generating additional demand for their hardware. Moreover, buying Silo AI allows AMD to quickly acquire AI talent that's familiar with the AMD stack (Silo AI runs LLMs on an AMD-based cluster). What’s your prediction for AMD 24 months from now?
9 Comments

Like Comment Share
Refuel reposted this

Rishabh Bhargava

Co-Founder and CEO at Refuel.ai | ex-Stanford, Cloudera, Primer.ai
3w Edited
Report this post
LLM benchmarks can be misleading. So much so, that Anthropic and OpenAI are investing millions to try and address this challenge. The natural instinct is to pick the model with the highest eval % and call it a day, right? Not exactly. 1. Public benchmarks use datasets that are not reflective of common usage by consumers or enterprise users - check out MMLU and Hellaswag for yourself. 2. Moreover, a recent study from Surge AI found that a third of these datasets contain typos and “nonsensical” writing. 3. Additionally, there’s no way to tell if the LLM is actually reasoning, or merely regurgitating an answer that the model was previously trained on - resulting in contamination. The bottom line - No benchmarks are going to be reflective of YOUR data. For you to trust AI models on your data and tasks, you’ll have to create your own evaluation datasets. The value of benchmarks increases the more specific they are. For example, Anthropic just announced an initiative to fund the development of new types of benchmarks (cyber attacks, manipulation, deception etc.) In Refuel's case, we’ve developed use case specific and industry specific benchmarks, such as in financial services and retail (pictured below). We’ve worked with our customers to carefully construct benchmarks with significant involvement from domain experts that are as close to real world performance as possible. Is your business using the right evals?
5 Comments

Like Comment Share
Refuel reposted this

Rishabh Bhargava

Co-Founder and CEO at Refuel.ai | ex-Stanford, Cloudera, Primer.ai
4w Edited
Report this post
“[𝑞𝑢𝑎𝑙𝑖𝑡𝑦 𝑑𝑎𝑡𝑎] 𝑖𝑠 𝑡ℎ𝑒 𝑏𝑖𝑔𝑔𝑒𝑠𝑡 𝑖𝑛ℎ𝑖𝑏𝑖𝑡𝑜𝑟 𝑓𝑜𝑟 𝑐𝑜𝑚𝑝𝑎𝑛𝑖𝑒𝑠 𝑡ℎ𝑎𝑡 ℎ𝑎𝑣𝑒 𝑎𝑙𝑟𝑒𝑎𝑑𝑦 𝑖𝑛𝑣𝑒𝑠𝑡𝑒𝑑 𝑛𝑜𝑤 𝑖𝑛 𝐿𝐿𝑀𝑠, 𝑎𝑟𝑐ℎ𝑖𝑡𝑒𝑐𝑡𝑢𝑟𝑒 𝑎𝑛𝑑 𝑝𝑒𝑜𝑝𝑙𝑒”. This was the quote that stood out the most in CB Insights “Enterprise AI Report” released last week. A few interesting insights and takeaways: 🚀 𝟏. 𝐓𝐡𝐞 𝐩𝐞𝐫𝐟𝐨𝐫𝐦𝐚𝐧𝐜𝐞 𝐠𝐚𝐩 𝐛𝐞𝐭𝐰𝐞𝐞𝐧 𝐨𝐩𝐞𝐧 𝐬𝐨𝐮𝐫𝐜𝐞 𝐚𝐧𝐝 𝐜𝐥𝐨𝐬𝐞𝐝 𝐬𝐨𝐮𝐫𝐜𝐞 𝐢𝐬 𝐜𝐥𝐨𝐬𝐢𝐧𝐠 𝐟𝐚𝐬𝐭: Meta’s open-source Llama-3-70B recently outperformed Anthropic’s Claude-3-Sonnet according to the MMLU benchmark (although Claude-3.5-Sonnet is back to being stronger than the Llama models). As business leaders grapple with financial constraints, they will have to find the sweet spot between performance, cost, and flexibility while considering the ROI of open source models. ⭐️ 𝟐. 𝐁𝐢𝐠𝐠𝐞𝐫 𝐢𝐬𝐧’𝐭 𝐚𝐥𝐰𝐚𝐲𝐬 𝐛𝐞𝐭𝐭𝐞𝐫: Smaller language models (SLMs) built for specific use cases are not only often faster and cheaper, but can also outperform LLMs For example, Microsoft Phi-3 with 7B parameters outperformed ChatGPT 3.5 trained on 20B parameters, as measured by MMLU. And of course, Refuel-LLM-2, our purpose-built model, outperforms GPT-4-Turbo on data labeling, cleaning and enrichment benchmarks. Domain-specific-models are not an opportunity enterprise buyers should shy away from, and should be explored for task specific applications. 📈 𝟑. 𝐏𝐫𝐨𝐩𝐫𝐢𝐞𝐭𝐚𝐫𝐲 𝐚𝐧𝐝 𝐜𝐥𝐞𝐚𝐧 𝐝𝐚𝐭𝐚 𝐚𝐫𝐞 𝐞𝐯𝐞𝐫𝐲𝐭𝐡𝐢𝐧𝐠: Clean data minimizes downstream AI effects and proprietary data drives differentiated business outcomes. As the quote below aptly alludes to, curating quality data and developing the supporting infrastructure will become the lifeblood of product development and the determinant of success in the era of Gen AI. We’re lucky to see this in action every day with our customers and partners — good data strategy, the curiosity and bravery to try task-specific models and focus on ROI — 𝐭𝐡𝐞𝐬𝐞 𝐚𝐫𝐞 𝐭𝐡𝐞 𝐢𝐧𝐠𝐫𝐞𝐝𝐢𝐞𝐧𝐭𝐬 𝐭𝐨 𝐬𝐮𝐜𝐜𝐞𝐬𝐬 𝐰𝐢𝐭𝐡 𝐀𝐈 𝐭𝐨𝐝𝐚𝐲. Which takeaway stood out to you the most?
1 Comment

Like Comment Share
Refuel reposted this

Rishabh Bhargava

Co-Founder and CEO at Refuel.ai | ex-Stanford, Cloudera, Primer.ai
1mo Edited
Report this post
OpenAI just announced the acquisition of Rockset. Super interesting move — OpenAI already has a few products where retrieval is important (ChatGPT and GPT-builder). A few interesting implications/questions: 1. What does this mean for companies focused on RAG tooling if some of these capabilities are going to be housed much closer to the model layer? 2. Which use cases are going to demand custom retrieval approaches that a one-size fits all from OpenAI won’t satisfy? 3. OpenAI gets a host of new enterprise sources from which they can pull data and make ChatGPT even more effective for them. 4. What is the next suite of products that we might see from OpenAI that build on retrieval capabilities, and not simply the improvement of the model layer? Also interesting to see this non-model-related announcement come through in the same week as Anthropic’s Claude 3.5 Sonnet — which looks like a very strong contender.
11 Comments

Like Comment Share
Refuel reposted this

Databricks Mosaic Research

26,753 followers
2mo
Report this post
The team at Refuel just released their latest #LLM for data labeling, enrichment and cleaning, trained on our Databricks Mosaic AI Training infrastructure! Get the details here: https://lnkd.in/gVcarizZ

Announcing Refuel LLM-2

refuel.ai

Like Comment Share
Refuel reposted this

Nihit Desai

Co-founder & CTO at Refuel.ai
2mo
Report this post
Better data = Better AI. In this episode of Software Engineering Daily I dive into why this is true, what makes it hard and how we're solving this at scale at Refuel. Thank you Sean Falconer for hosting and having me on! 🚀

Software Engineering Daily

2,605 followers
2mo

Nihit Desai of Refuel joins the show with Sean Falconer to talk about the platform, and how to manage data in the current AI era. Listen here: https://lnkd.in/gdDsaa75

Using LLMs for Training Data Preparation with Nihit Desai - Software Engineering Daily

softwareengineeringdaily.com

Like Comment Share
Refuel

1,104 followers
5mo
Report this post
🚀 TeachFX + Refuel: Leveraging Custom LLMs to Enhance Classroom Interactions 🎓 92% Agreement with human experts, in a complex domain ⏱ Reduced AI feature development time from 2 months to 2 weeks 📚 TeachFX, an ed-tech company focused on elevating classroom dialogue, teamed up with Refuel to revolutionize their product with new AI capabilities, enabling the detection of pivotal educational moments in classroom sessions. ✅ Leveraging Refuel's platform, TeachFX achieved a 92% agreement with expert annotators to create training datasets, on a complex, domain-specific task. ⚡ This streamlined the feature development process from two months to just two weeks, enabling a dramatic acceleration of TeachFX’s product roadmap. 💡 This partnership not only exemplifies the power of custom LLMs in enhancing data labeling efficiency and output quality, but also marks a significant stride towards improving educational outcomes. 👉 If you're interested to learn about how custom LLMs are changing the game with respect to data quality, check out the full case study in the comments below. For more insights into leveraging AI for educational excellence, follow TeachFX and Refuel on LinkedIn or sign up for a Refuel demo here: https://lnkd.in/gtKqbXix.
1 Comment

Like Comment Share
Refuel

1,104 followers
6mo
Report this post
🚀 Retail AI success story for Beni + Refuel: Data normalization with LLMs for product catalog data 🎯 2x Accuracy Improvement 💨 < 1 day of Engineering Effort 📈 245% Increase In GMV for a major partner 👕👖 With a massive catalog of over 200 million items, Beni faced a daunting task: improving the accuracy of their product size attribute from 46% to over 80%. The solution? A partnership with Refuel and our custom LLMs. 🚀 In just one day of effort (compared to the weeks or months such a task would typically require), Beni improved the accuracy to an astounding 87%. This led to a 99% reduction in data quality issues for reseller partners and a 245% increase in Gross Merchandise Value for a major partner. 💡 If you're intrigued by how AI can improve product catalog data and drive significant business impact for marketplaces, check out this case study (click on the comments for the full story) 👉 For more stories like this, follow Refuel on LinkedIn or sign up for Refuel here: https://lnkd.in/gtKqbXix. #datascience #machinelearning #aiinnovation #retailtech #customervalue #speed #revenueboost #ai #ml #llms

1 Comment

Like Comment Share
Refuel

1,104 followers
12mo
Report this post
Labeling with Confidence: Confidence estimation is an effective tool to mitigate hallucinations when leveraging LLMs for data labeling and enrichment: If we are able to estimate the model’s inherent confidence in its response, we can automatically reject low confidence labels, chain and ensemble LLMs. Excited to share a bit more about what we've been exploring and building at Refuel in this direction: https://lnkd.in/gyg54vfZ. You can access all of these features in Autolabel (https://lnkd.in/g7dX8Awi) with a one line config change to your labeling task!

Labeling with Confidence

refuel.ai

Like Comment Share

Browse jobs

Funding

Refuel 2 total rounds

Last Round

Seed Jul 15, 2023

US$ 5.2M

Investors

General Catalyst XYZ Venture Capital

See more info on crunchbase

Refuel

Software Development

San Francisco, CA 1,104 followers

Clean, labeled data at the speed of thought

About us

Locations

Employees at Refuel

Derek D.

James M Lopez

Visionary Entrepreneur

James M Lopez

ReFuel Corp CEO

Nihit Desai

Co-founder & CTO at Refuel.ai

Updates

Announcing Refuel LLM-2

refuel.ai

Using LLMs for Training Data Preparation with Nihit Desai - Software Engineering Daily

softwareengineeringdaily.com

Labeling with Confidence

refuel.ai

Join now to see what you are missing

Similar pages

Uplimit

Primer.ai

Essential AI

First Round Fast Track

Beni

MLOps Roundup

On Deck

Portrait Analytics

Spiritus

Cloudera

Browse jobs

Developer jobs

Engineer jobs

Machine Learning Engineer jobs

Funding