search
HomeWeb Front-endJS TutorialSimplifying Data Extraction with OpenAI JSON Mode and JSON Schemas

Simplifying Data Extraction with OpenAI JSON Mode and JSON Schemas

When I first experimented with ChatGPT-3.5 after its release, I was thrilled about its potential for various applications. However, my excitement quickly faded when I encountered a major roadblock: although the valuable information it returned was exceptionally readable, it was not in a form that could be reliably ingested by an application. Ironically, LLMs excel in extracting information from unstructured text but can only return it in unstructured form. Trying to programmatically extract results from LLMs felt like being at an incredible restaurant that serves the most delicious food, but without any utensils — you can see it and smell it, but you just can’t get to it.

I tried every trick in the book to coax it into giving me some semblance of structured data. “Please, just separate each item with a bar or a new line and skip the commentary,” I’d plead. Sometimes it worked, sometimes it didn’t. Sometimes it would “helpfully” number or reorder the items, like a well-meaning but slightly confused assistant. Other times it would still sneak in some commentary, reminiscent of a chatty co-worker. I even demanded it in no uncertain terms to return JSON and nothing else, but it sometimes left out a comma — almost as if it were taking a passive-aggressive jab. Eventually, I gave up and reluctantly returned to the less exciting but more predictable confines of traditional algorithms.

Fortunately, a few months later, OpenAI introduced JSON mode, a feature that forces the LLM to return valid JSON. I decided to try this feature and found it significantly more effective for processing results in my applications. Here’s an example of the output with JSON mode enabled:

PROMPT:‍
Parse the following sentence into words and then return the results
as a list of the original word and the translation in English and
return the results in JSON.
-- sentence --
早安‍


RESULTS:
{
  "results": [
    {
      "original": "早安",
      "translation": "Good morning"
    }
  ]
}

This output is certainly an improvement. However, while the output is valid JSON, its structure can vary depending on the contents of the prompt. A more predictable approach is to specify the desired return format. One way to achieve this is by providing a sample JSON structure for the LLM to follow. This method involves creating an example and writing code to parse it. If the structure changes, modifications must be done in both places.

An alternative approach is to define a Data Transfer Object (DTO) to hold the results and use it both to instruct the LLM and to parse the results, avoiding synchronization issues. First, define the DTO, for example:‍

record Entries(List<entry> entries) {
    record Entry(String originalWord, String wordInEnglish, String pronunciation) {}
}
</entry>

Now the DTO can be used in the prompt instructions as well as by the parsing code:

// Construct the prompt with the output schema.
var prompt = MessageFormat.format("""
   Parse the following sentence into English and return the results
   in JSON according to the following JSON schema.

   人工智慧將引領未來,以智慧之光照亮人類無限可能的前程。
   --- output json schema ---
   {0}
   """, jsonSchemaOf(Entries.class));
   var result = sendPrompt(prompt, Entries.class);

Here’s the code that uses the Jackson JSON Schema generator:

PROMPT:‍
Parse the following sentence into words and then return the results
as a list of the original word and the translation in English and
return the results in JSON.
-- sentence --
早安‍


RESULTS:
{
  "results": [
    {
      "original": "早安",
      "translation": "Good morning"
    }
  ]
}

Note: By default, the generated schema will include ID fields used for references, which can waste tokens. See the repository OpenAI JSON Mode Sample for code that removes these unused IDs.

And finally, here’s the code that sends the prompt to OpenAI using the Azure OpenAI Java SDK:

record Entries(List<entry> entries) {
    record Entry(String originalWord, String wordInEnglish, String pronunciation) {}
}
</entry>

The solution works well most of the time. The LLM understands the JSON schema effectively but a word of caution: I’ve seen cases where it sometimes gets it wrong. For example, if a field is a String and its name is plural (e.g. “exampleValues”), the LLM sometimes insist on returning an array of Strings instead.

LLMs can generate remarkable outputs, sometimes exceeding the capabilities of the average person. However, it’s intriguing that, at least for now, they struggle with the more mundane task of reliably formatting their generated output.

The above is the detailed content of Simplifying Data Extraction with OpenAI JSON Mode and JSON Schemas. For more information, please follow other related articles on the PHP Chinese website!

Statement
The content of this article is voluntarily contributed by netizens, and the copyright belongs to the original author. This site does not assume corresponding legal responsibility. If you find any content suspected of plagiarism or infringement, please contact admin@php.cn
Javascript Data Types : Is there any difference between Browser and NodeJs?Javascript Data Types : Is there any difference between Browser and NodeJs?May 14, 2025 am 12:15 AM

JavaScript core data types are consistent in browsers and Node.js, but are handled differently from the extra types. 1) The global object is window in the browser and global in Node.js. 2) Node.js' unique Buffer object, used to process binary data. 3) There are also differences in performance and time processing, and the code needs to be adjusted according to the environment.

JavaScript Comments: A Guide to Using // and /* */JavaScript Comments: A Guide to Using // and /* */May 13, 2025 pm 03:49 PM

JavaScriptusestwotypesofcomments:single-line(//)andmulti-line(//).1)Use//forquicknotesorsingle-lineexplanations.2)Use//forlongerexplanationsorcommentingoutblocksofcode.Commentsshouldexplainthe'why',notthe'what',andbeplacedabovetherelevantcodeforclari

Python vs. JavaScript: A Comparative Analysis for DevelopersPython vs. JavaScript: A Comparative Analysis for DevelopersMay 09, 2025 am 12:22 AM

The main difference between Python and JavaScript is the type system and application scenarios. 1. Python uses dynamic types, suitable for scientific computing and data analysis. 2. JavaScript adopts weak types and is widely used in front-end and full-stack development. The two have their own advantages in asynchronous programming and performance optimization, and should be decided according to project requirements when choosing.

Python vs. JavaScript: Choosing the Right Tool for the JobPython vs. JavaScript: Choosing the Right Tool for the JobMay 08, 2025 am 12:10 AM

Whether to choose Python or JavaScript depends on the project type: 1) Choose Python for data science and automation tasks; 2) Choose JavaScript for front-end and full-stack development. Python is favored for its powerful library in data processing and automation, while JavaScript is indispensable for its advantages in web interaction and full-stack development.

Python and JavaScript: Understanding the Strengths of EachPython and JavaScript: Understanding the Strengths of EachMay 06, 2025 am 12:15 AM

Python and JavaScript each have their own advantages, and the choice depends on project needs and personal preferences. 1. Python is easy to learn, with concise syntax, suitable for data science and back-end development, but has a slow execution speed. 2. JavaScript is everywhere in front-end development and has strong asynchronous programming capabilities. Node.js makes it suitable for full-stack development, but the syntax may be complex and error-prone.

JavaScript's Core: Is It Built on C or C  ?JavaScript's Core: Is It Built on C or C ?May 05, 2025 am 12:07 AM

JavaScriptisnotbuiltonCorC ;it'saninterpretedlanguagethatrunsonenginesoftenwritteninC .1)JavaScriptwasdesignedasalightweight,interpretedlanguageforwebbrowsers.2)EnginesevolvedfromsimpleinterpreterstoJITcompilers,typicallyinC ,improvingperformance.

JavaScript Applications: From Front-End to Back-EndJavaScript Applications: From Front-End to Back-EndMay 04, 2025 am 12:12 AM

JavaScript can be used for front-end and back-end development. The front-end enhances the user experience through DOM operations, and the back-end handles server tasks through Node.js. 1. Front-end example: Change the content of the web page text. 2. Backend example: Create a Node.js server.

Python vs. JavaScript: Which Language Should You Learn?Python vs. JavaScript: Which Language Should You Learn?May 03, 2025 am 12:10 AM

Choosing Python or JavaScript should be based on career development, learning curve and ecosystem: 1) Career development: Python is suitable for data science and back-end development, while JavaScript is suitable for front-end and full-stack development. 2) Learning curve: Python syntax is concise and suitable for beginners; JavaScript syntax is flexible. 3) Ecosystem: Python has rich scientific computing libraries, and JavaScript has a powerful front-end framework.

See all articles

Hot AI Tools

Undresser.AI Undress

Undresser.AI Undress

AI-powered app for creating realistic nude photos

AI Clothes Remover

AI Clothes Remover

Online AI tool for removing clothes from photos.

Undress AI Tool

Undress AI Tool

Undress images for free

Clothoff.io

Clothoff.io

AI clothes remover

Video Face Swap

Video Face Swap

Swap faces in any video effortlessly with our completely free AI face swap tool!

Hot Article

Hot Tools

WebStorm Mac version

WebStorm Mac version

Useful JavaScript development tools

mPDF

mPDF

mPDF is a PHP library that can generate PDF files from UTF-8 encoded HTML. The original author, Ian Back, wrote mPDF to output PDF files "on the fly" from his website and handle different languages. It is slower than original scripts like HTML2FPDF and produces larger files when using Unicode fonts, but supports CSS styles etc. and has a lot of enhancements. Supports almost all languages, including RTL (Arabic and Hebrew) and CJK (Chinese, Japanese and Korean). Supports nested block-level elements (such as P, DIV),

MantisBT

MantisBT

Mantis is an easy-to-deploy web-based defect tracking tool designed to aid in product defect tracking. It requires PHP, MySQL and a web server. Check out our demo and hosting services.

SublimeText3 Chinese version

SublimeText3 Chinese version

Chinese version, very easy to use

ZendStudio 13.5.1 Mac

ZendStudio 13.5.1 Mac

Powerful PHP integrated development environment