OpenAI Archives - Jet Developers Blog

OpenAI’s Big Move: Codex Comes to macOS

Artificial intelligence is no longer just helping developers write code — it’s reshaping the entire software development process. What once required hours of manual effort is now increasingly handled by swarms of AI agents and sub-agents working behind the scenes. As developers explore new ways to collaborate with AI, even the most advanced AI labs are finding it difficult to keep pace with how fast things are moving. One of the biggest shifts right now is toward agentic software development. In this approach, AI agents don’t just assist — they work independently on coding tasks, making decisions and executing work with minimal human input.

OpenAI is now taking a significant step forward. On Monday, the company launched a new macOS app for Codex, designed to fully embrace modern agentic workflows.

The new app supports:

Multiple AI agents working in parallel
Advanced agent skills and shared state
Modern, flexible workflows inspired by the last year of experimentation in AI coding tools

This release follows closely on the heels of GPT-5.2-Codex, OpenAI’s most powerful coding model to date, launched less than two months ago. The company clearly hopes this combination of power and usability will persuade developers currently using Claude Code to switch.

“If you really want to do sophisticated work on something complex, 5.2 is the strongest model by far,”
— Sam Altman, CEO of OpenAI

Altman also acknowledged that raw capability isn’t enough — usability matters. The new macOS app aims to make that power easier and more flexible to access.

New Features Designed for Real Developers

Beyond raw performance, the Codex macOS app introduces features aimed at matching — or even surpassing — competing tools:

Background automations that run on a schedule
A review queue for completed tasks
Customizable agent personalities, ranging from pragmatic to empathetic, to suit different working styles

These features are designed to reduce context switching and help developers stay focused on higher-level thinking.

For OpenAI, the biggest advantage isn’t just intelligence — it’s speed.

“You can use this from a clean sheet of paper to build something genuinely sophisticated in just a few hours,” Altman explained.
“As fast as I can type new ideas, that’s the limit of what can get built.”

This vision captures where software development is heading: a future where human creativity sets the pace, and AI handles the heavy lifting.

What do you think — will fully agentic coding tools replace traditional development workflows, or will human-AI collaboration always need a strong human hand at the center?

GPT-4o: OpenAI’s Latest Breakthrough in AI Technology

At the recent Spring Updates event, OpenAI’s Chief Technology Officer, Mira Murati, unveiled the latest breakthrough in AI technology – the GPT-4o multimodal foundation model. This innovative model, along with the introduction of the ChatGPT desktop app, marks a significant milestone for both free and paid users.

The Power of GPT-4o: Voice, Text, and Vision Integration

“It reasons across voice, text, and vision,” Murati exclaimed, highlighting the versatility of this new model. Notably, users will soon be able to capture real-time video through their ChatGPT smartphone apps, expanding the capabilities beyond text-based interactions.

Democratizing AI: Access for All Users

OpenAI aims to demystify AI technology by making it accessible to all users. With the release of GPT-4o, free users will no longer be limited to text-only interactions but will have access to powerful image and document analysis capabilities.

Say hello to GPT-4o, our new flagship model which can reason across audio, vision, and text in real time: https://t.co/MYHZB79UqN

Text and image input rolling out today in API and ChatGPT with voice and video in the coming weeks. pic.twitter.com/uuthKZyzYx
— OpenAI (@OpenAI) May 13, 2024

Rollout Plan: From Plus to Enterprise Users

While GPT-4o will eventually be available to all ChatGPT users, the rollout will begin with paying subscribers. Plus and Team users will enjoy increased message limits, with availability for Enterprise users on the horizon.

Real-Time Responses and Emotion Detection

One of the most exciting features of GPT-4o is its ability to respond in real-time across audio inputs, detect emotions, and adjust its voice accordingly. This functionality brings a new level of naturalism to AI interactions, similar to rival AI startup Hume.

Pricing and Performance: Enhanced Efficiency

In terms of pricing and performance, It offers compelling advantages. With half the price and double the speed of GPT-4 Turbo, along with increased rate limits, developers can expect a more efficient AI solution.

in the API, GPT-4o is half the price AND twice as fast as GPT-4-turbo. and 5x rate limits. pic.twitter.com/vqV8XwNcYp
— Sam Altman (@sama) May 13, 2024

Embracing Innovation: User Adoption and Expectations

With over 100 million ChatGPT users and a thriving ecosystem of custom GPTs in the GPT Store, OpenAI is poised to revolutionize AI interactions. The recent confirmation that the mysterious “gpt2-chatbot” is indeed GPT-4o underscores the anticipation surrounding this new technology.

Conclusion: A New Era of AI Interaction

Despite some minor hiccups during live demos, the future looks promising for GPT-4o and the ChatGPT ecosystem. As users anticipate its widespread availability, the question remains: will GPT-4o redefine AI interactions and set a new standard for naturalistic experiences? Only time will tell.

OpenAI’s Model Sora unveiled First Music Video Generated

OpenAI sent shockwaves through the tech community and the arts scene earlier this year with the unveiling of their groundbreaking AI model, Sora. This innovative technology promises to revolutionize the creation of videos by producing realistic, high-resolution, and seamlessly smooth clips lasting up to 60 seconds each. Sora unveiled First Music Video, however, Sora’s debut has not been without controversy, stirring up concerns among traditional videographers and artists.

The Unveiling of Sora

In February 2024, OpenAI made waves by introducing Sora to a select audience. Although the technology remains unreleased to the public, OpenAI granted access to a small group of “red teamers” for risk assessment and a handpicked selection of visual artists, designers, and filmmakers. Despite this limited release, some early users have already begun experimenting with Sora, producing and sharing innovative projects.

The First Official Music Video with Sora

Among OpenAI’s chosen early access users is writer/director Paul Trillo, who recently made headlines by creating what is being hailed as the “first official music video made with OpenAI’s Sora.” Collaborating with indie chillwave musician Washed Out, Trillo crafted a mesmerizing 4-minute video for the single “The Hardest Part.” The video comprises a series of quick zoom shots seamlessly stitched together, creating the illusion of a continuous zoom effect.

Behind the Scenes

Trillo revealed that the concept for the video had been brewing in his mind for a decade before finally coming to fruition. He disclosed that the video consists of 55 separate clips generated by Sora from a pool of 700, meticulously edited together using Adobe Premiere.

First official commissioned music video made with @OpenAI Sora for @realwashedout

This was an idea I had almost 10 years ago and then abandoned. Finally was able to bring it to life.

Watch the full video here https://t.co/sGpmMLVCul pic.twitter.com/J3RxRD9nzo
— Paul Trillo (@paultrillo) May 2, 2024

Integration with Premiere Pro

Meanwhile, Adobe has expressed interest in incorporating Sora and other third-party AI video generator models into its Premiere Pro software. However, no timeline has been provided for this integration. Until then, users seeking to replicate Trillo’s workflow may need to generate AI video clips using third-party software like Runway or Pika before importing them into Premiere.

The Artist’s Perspective

In an interview with the Los Angeles Times, Washed Out expressed excitement about incorporating cutting-edge technology like Sora into his creative process. He highlighted the importance of exploring new tools and techniques to push the boundaries of artistic expression.

Power of Sora

Trillo’s use of Sora’s text-to-video capabilities underscores the technology’s potential in the creative landscape. By relying solely on Sora’s abilities, Trillo bypassed the need for traditional image inputs, showcasing the model’s versatility and power.

Embracing AI in Creativity

Trillo’s groundbreaking music video serves as a testament to the growing interest among creatives in harnessing AI tools to tell compelling stories. Despite criticisms of AI technology’s potential exploitation and copyright issues, many artists continue to explore its possibilities for innovation and expression.

Conclusion

As OpenAI continues to push the boundaries of AI technology with Sora, the creative community eagerly anticipates the evolution of storytelling and artistic expression in the digital age. Trillo’s pioneering work with Sora exemplifies the transformative potential of AI in the realm of media creation, paving the way for a new era of innovation and creativity.

How To Use New ChatGPT’s Memory Feature

OpenAI continues to evolve its renowned ChatGPT, introducing a slew of new features aimed at enhancing user experience and control. From memory management to temporary chats, here’s a comprehensive guide to making the most of ChatGPT’s Memory Feature latest offerings.

Unlocking ChatGPT’s Memory Feature:

ChatGPT Plus subscribers ($20 per month) can now leverage the expanded persistent memory feature, allowing them to store and recall vital information effortlessly. Learn how to utilize this feature to enhance your interactions with ChatGPT and streamline your workflow.

How to Use ChatGPT’s Memory Feature:

Discover the step-by-step process for storing information using ChatGPT’s memory feature. From inputting details to managing stored memories, we’ll walk you through the process to ensure seamless integration into your ChatGPT experience.

Important Limitations and Workarounds:

While ChatGPT’s memory feature offers enhanced functionality, it’s essential to understand its limitations. Explore the current restrictions and discover potential workarounds to maximize the utility of this feature.

Optimizing Temporary Chats for Temporary Projects:

For temporary projects or sensitive discussions, ChatGPT offers the option of starting a “temporary chat.” Learn how to initiate and manage temporary chats, ensuring privacy and security without compromising on functionality.

Accessing and Managing Chat History:

ChatGPT users now have more control over their chat history, with enhanced accessibility and management options. Explore how to access previous chats, retain chat history, and navigate through archived conversations with ease.

Empowering User Control with Data Controls:

OpenAI prioritizes user control and privacy with enhanced data controls. Discover how to manage data sharing preferences, opt-in or out of model training, and delete chat history to tailor your ChatGPT experience to your preferences.

Conclusion:

With these latest updates, OpenAI continues to empower users with greater control and functionality within ChatGPT. Whether you’re a seasoned user or new to the platform, these features offer enhanced capabilities and customization options for a seamless AI-powered interaction experience. Stay tuned for further advancements as OpenAI remains at the forefront of AI innovation.

OpenAI Partnership with Financial Times to Elevate ChatGPT’s Journalism Capabilities

OpenAI latest move involves a strategic partnership with the esteemed British news daily, Financial Times (FT), aimed at enriching the journalistic content available through ChatGPT. This collaboration signifies a concerted effort to provide users with high-quality news articles directly sourced from FT, along with relevant summaries, quotes, and links—all properly attributed, as emphasized by both parties in a recent press release.

Driving Forces Behind the Partnership

In light of recent debates surrounding AI companies’ ethical use of training data, particularly in relation to web scraping practices, OpenAI’s decision to forge partnerships with reputable publications like FT reflects a strategic pivot towards responsible data sourcing. This move comes amidst regulatory scrutiny, such as the recent fine imposed on Google by France’s competition watchdog for unauthorized use of publishers’ content in training AI models.

By partnering with FT, OpenAI aims to bolster ChatGPT’s standing as a leading AI chatbot while ensuring compliance with ethical data usage standards. Beyond content aggregation, the collaboration entails joint efforts to develop innovative AI products and features tailored to FT’s audience, potentially signaling a new era of symbiotic relationships between AI research labs and media organizations.

Perspectives from OpenAI and FT

Brad Lightcap, OpenAI’s COO, underscores the collaborative nature of the partnership, emphasizing the mutual goal of leveraging AI to enhance news delivery and reader experiences globally. Meanwhile, FT Group CEO John Ridding reaffirms the publication’s commitment to upholding journalistic integrity amidst technological advancements, emphasizing the importance of safeguarding content and brand reputation in the digital age.

Previous Partnerships and Challenges

OpenAI’s collaboration with FT follows similar partnerships with renowned media entities like Associated Press (AP), Axel Springer, and the American Journalism Project (AJP), underscoring the research lab’s ongoing efforts to diversify its training datasets responsibly. However, the journey hasn’t been without its hurdles, as evidenced by legal challenges from entities like the New York Times and multiple American publications alleging copyright infringement—a reminder of the complex legal and ethical considerations inherent in AI development.

In summary, OpenAI’s alliance with FT represents a significant step towards fostering synergy between AI technology and journalism, with the potential to shape the future of news consumption and content creation in the digital era. As both parties navigate this evolving landscape, their collaboration underscores the pivotal role of responsible data partnerships in driving AI innovation while upholding journalistic integrity.

OpenAI Empowers Personalized AI with Fine-Tuning API Enhancements

In a groundbreaking move towards personalized artificial intelligence, OpenAI unveils significant upgrades to its fine-tuning API and extends its custom models program, empowering developers with enhanced control and customization options.

Fine-Tuning API Advancements

Since its inception in August 2023, the fine-tuning API for GPT-3.5 has revolutionized AI model refinement. The latest enhancements include epoch-based checkpoint creation, minimizing retraining needs and overfitting risks. A new comparative Playground UI facilitates side-by-side evaluations, enhancing development with human insights. With third-party integration and comprehensive validation metrics, these updates mark a major leap in fine-tuning technology.

Expanding the Custom Models Program

OpenAI’s expansion of the Custom Models program offers assisted fine-tuning and fully custom-trained models, catering to organizations with specialized needs. Assisted fine-tuning leverages collaborative efforts to maximize model performance, exemplified by success stories like SK Telecom’s enhanced customer service performance. Meanwhile, fully custom-trained models address unique requirements, as seen in Harvey, an AI tool for attorneys, enhancing legal case law analysis accuracy.

The Future of AI Customization

OpenAI envisions a future where customized AI models become standard for businesses seeking optimal AI performance. With the fine-tuning API enhancements and expanded custom models program, organizations can develop AI solutions finely tuned to their specific needs, leading to enhanced outcomes and efficiency.

Getting Started

For those eager to explore these capabilities, OpenAI provides access to fine-tuning API documentation. Organizations interested in custom model collaboration can access further information on customization and partnership opportunities.

Conclusion: A New Era of Personalized AI

As AI continues to integrate into diverse sectors, OpenAI’s advancements signify a new era of customization and efficiency. These updates promise significant benefits for businesses and developers alike, paving the way for personalized AI solutions tailored to specific requirements.

One of the most intriguing aspects of OpenAI’s progress is the potential for seamless integration with existing systems. This compatibility opens the door for a wide array of applications across industries, including advanced customer service chatbots, predictive analytics tools, and automated content generation platforms.

Furthermore, the continuous evolution of OpenAI’s technology fosters a dynamic environment where businesses can harness the power of AI to drive innovation and growth. From streamlining internal processes to enhancing customer experiences, the possibilities are vast and transformative.

In essence, OpenAI’s groundbreaking developments are reshaping the business landscape, offering an array of tools and resources that empower organizations to achieve greater efficiency, productivity, and foresight. With ongoing advancements, the future holds even more promising prospects for leveraging AI to its full potential.

OpenAI Unveils Voice Engine: The Future of Voice Cloning and Text-to-Speech Technology

OpenAI expands its AI capabilities into the realm of audio with the introduction of Voice Engine. This innovative model, developed since 2022, powers OpenAI’s text-to-speech API and introduces new features like ChatGPT Voice and Read Aloud.

Revolutionizing Audio Content Creation

Voice Engine’s remarkable ability to clone human voices has significant implications for content creators across various industries, including podcasting, voice-over, gaming, customer service, and more. By generating natural-sounding speech that closely resembles the original speaker, Voice Engine opens up endless possibilities for personalized and interactive audio experiences.

Leading the Way in Accessibility

Beyond content creation, Voice Engine offers support for non-verbal individuals, providing them with unique, non-robotic voices. This breakthrough technology has the potential to revolutionize therapeutic and educational programs for individuals with speech impairments or learning needs, fostering inclusivity and accessibility.

Real-World Applications

OpenAI has already partnered with trusted organizations to test Voice Engine in real-world scenarios:

Age of Learning: Utilizes Voice Engine and GPT-4 for personalized voice content in educational programs.
HeyGen: Employs Voice Engine for video translation and multilingual avatar creation.
Dimagi: Provides interactive feedback in multiple languages for community health workers.
Livox: Integrates Voice Engine for unique voices in Augmentative and Alternative Communication (AAC) devices.
Norman Prince Neurosciences Institute: Assists individuals with neurological disorders in restoring speech using Voice Engine.

Responsible Deployment and Safety Measures

While Voice Engine holds immense potential, OpenAI is proceeding cautiously to ensure responsible deployment. The technology is currently limited to a select group of partners, with stringent safety and ethical guidelines in place to prevent misuse. OpenAI remains committed to fostering a dialogue on the ethical use of synthetic voices and continues to implement safety measures to safeguard against misuse.

As OpenAI continues to push the boundaries of AI technology, Voice Engine stands as a testament to the endless possibilities of artificial intelligence in shaping the future of audio content creation and accessibility.

ChatGPT Unveils Voice and Image Features

OpenAI’s ChatGPT Unveils New Voice and Image Features for Enhanced User Interaction

OpenAI’s ChatGPT, the AI-powered language model, is unveiling a set of exciting new features, allowing users to “see, hear, and speak.” These enhancements are designed to make ChatGPT more user-friendly and versatile, offering a variety of ways for users to interact with the AI model.

OpenAI has announced a phased rollout of voice and image capabilities within ChatGPT over the next two weeks. These features are intended to empower users to engage in voice conversations and visually convey their queries to ChatGPT, making the AI experience even more interactive and accessible.

The primary goal behind these updates is to enhance the utility and user-friendliness of ChatGPT. According to MIT Technology Review, OpenAI has been diligently refining its technology with the aim of providing a comprehensive AI solution through the ChatGPT Plus app. This puts it in direct competition with virtual assistants like Siri, Google Assistant, and Alexa.

OpenAI emphasized the significance of these new features, stating, “Voice and image give you more ways to use ChatGPT in your life. Snap a picture of a landmark while traveling and have a live conversation about what’s interesting about it.” The voice feature will be available on both iOS and Android platforms, with the option to opt-in through your settings, while the image feature will be functional across all platforms.

OpenAI went on to explain how users can leverage these capabilities: “You can now use voice to engage in a back-and-forth conversation with your assistant. Speak with it on the go, request a bedtime story for your family, or settle a dinner table debate.”

The image feature had been hinted at earlier in March when GPT-4, the model powering ChatGPT, was introduced. However, it was not accessible to the general public at the time. Now, users can upload images to the app and inquire about the content of those images, expanding the AI’s versatility.

MIT Technology Review also noted that this announcement follows the recent integration of DALL-E 3, OpenAI’s image-generation model, into ChatGPT. This integration allows users to instruct the chatbot to generate images based on their input.

Additionally, OpenAI has partnered with Be My Eyes, enabling users to ask ChatGPT questions based on images, further expanding its practical applications.

Powering the voice feature of ChatGPT, OpenAI utilized Whisper, its speech-to-text model, to convert spoken words into text, which ChatGPT can then process, enabling voice interactions with the AI software. Joanne Jang, a producer manager at OpenAI, mentioned that synthetic voices were created by training the text-to-speech model on the voices of hired actors. OpenAI is also considering the possibility of allowing users to create their own custom voices in the future.

OpenAI is taking privacy, safety, and accessibility concerns seriously with the introduction of these features. They have outlined a multifaceted approach to address these issues, including content moderation, responsible data handling, clear user guidelines, restrictions on sensitive topics, and a strong focus on ethical software use. Furthermore, OpenAI is actively collaborating with external organizations, researchers, and experts to conduct audits and assessments of the system, ensuring that ChatGPT remains a responsible and reliable tool for users.

OpenAI Launches ChatGPT Enterprise: Unleashing GPT-4’s Power for Businesses

OpenAI, led by Sam Altman, has introduced ChatGPT Enterprise, marking a significant milestone following the initial launch of their conversational AI, ChatGPT. This new enterprise-level tool heralds a major advancement, granting businesses unrestricted access to GPT-4, boasting performance twice as fast as its predecessors, according to a report by CNBC.

In the preceding year, OpenAI gained widespread recognition with the introduction of ChatGPT. This AI marvel allowed numerous users to experience the capabilities of generative artificial intelligence firsthand. Within a few months, ChatGPT garnered over 100 million active monthly users, outpacing popular platforms like Instagram and Spotify in this remarkable achievement.

Subsequently, OpenAI captured attention through its deepening collaboration with Microsoft, which generously provided substantial financial support in exchange for access to OpenAI’s advanced AI model to enhance its own suite of tools. Notably, the unveiling of ChatGPT Enterprise marks OpenAI’s first product launch since the ChatGPT Plus subscription service, which offered enhanced access to the tool’s features.

Empowering Enterprises

Delving into the specifics of ChatGPT Enterprise, as detailed in CNBC’s report, OpenAI diligently crafted this enterprise version over the span of less than a year. Collaborating with over 20 companies spanning diverse industries and sizes, OpenAI officially launched this version, bestowing enterprises with access to GPT-4 and Application Programming Interface (API) credits. OpenAI asserts that an impressive 80 percent of Fortune 500 companies currently utilize ChatGPT. The Enterprise iteration empowers these enterprises to leverage their own data for training custom models, aiming to alleviate concerns about sensitive information inadvertently being shared with OpenAI through ChatGPT usage.

Addressing these concerns, OpenAI refutes allegations of training its models on user data. To enhance data security, the Enterprise version incorporates an additional layer of encryption for client data. However, the pricing structure for this enhanced offering remains undisclosed at this time.

Racing Ahead

In terms of competitors, ChatGPT Enterprise has already garnered clients such as Block, led by Jack Dorsey, and investment group Carlyle. While an official launch date remains unspecified, OpenAI also has plans to introduce a Business version tailored to smaller companies and teams. Notably, this strategic move positions OpenAI in direct competition with its primary financier, Microsoft. The Azure OpenAI service from Microsoft has enabled businesses to access ChatGPT, but OpenAI’s independent offering could potentially save businesses costs by negating the need for a Microsoft Azure subscription.

OpenAI’s extensive operations, particularly its management of ChatGPT, involve substantial financial expenditure due to the sheer volume of requests processed each month. This prompts OpenAI to seek innovative revenue streams to sustain these services and continue refining their product line. Amidst intensifying competition in the generative AI sector, as exemplified by Anthropic’s upgraded AI model Claude and rumors of Amazon’s prospective AI offering, OpenAI is positioning itself for the enduring competition that lies ahead.

The pivotal question remains whether businesses are inclined to embrace GPT-powered decision-making in the immediate future.

New York Times Legal Action Against ChatGPT

New York Times Contemplates Legal Action Against OpenAI’s ChatGPT Over Content Usage

The potential for a legal battle is brewing as The New York Times contemplates taking legal action against OpenAI concerning its AI chatbot, ChatGPT. Sources close to the matter have informed NPR that the newspaper is considering a lawsuit due to concerns over the unauthorized use of its content by ChatGPT.

One of the primary grievances is that OpenAI is utilizing The New York Times’ stories to generate text without compensating the newspaper for the usage of its material. Furthermore, the newspaper is apprehensive that ChatGPT’s ability to furnish answers based on its reporting might diminish its online traffic by providing users with information directly, bypassing the need to visit the newspaper’s website.

These matters have led to discussions between The New York Times and OpenAI to establish a licensing agreement. Regrettably, the negotiations have soured, and the newspaper is now looking into legal options to address the situation.

Should The New York Times pursue legal action on ChatGPT, it could set a significant precedent in the nascent realm of generative AI, which involves creating novel content from existing data. ChatGPT stands as a prominent example of this technology, proficiently generating coherent and pertinent text across a range of subjects by drawing upon an extensive dataset scraped from the internet.

However, this dataset contains millions of articles from various sources, including The New York Times, obtained without proper authorization. The newspaper asserts that this unauthorized use violates its intellectual property rights and accuses ChatGPT of competing with its journalism by leveraging its stories as a knowledge source.

Generative AI’s Role in Search Engines

Particular concern revolves around the potential consequences of ChatGPT on search engines. These engines are increasingly incorporating generative AI tools to supply responses to user queries. Notably, Microsoft, a significant investor in OpenAI, employs ChatGPT to empower its Bing search engine.

The New York Times is worried that users receiving answers from ChatGPT that stem from its reporting might lose incentive to visit its official website and engage with its articles. Such a scenario could negatively impact both the newspaper’s revenue and its reputation, as shared by an anonymous source.

Within the legal landscape, the implications of generative AI remain murky due to the absence of precedents. Nevertheless, experts speculate that The New York Times might have a strong legal standing against OpenAI if it can demonstrate that ChatGPT violates its copyright.

Under U.S. law, potential repercussions for OpenAI could encompass a court order mandating the destruction of its dataset and the potential payment of up to $150,000 per infringement. Such consequences could deal a severe blow to OpenAI, compelling them to reconstruct their dataset solely from authorized content.

Daniel Gervais, an intellectual property authority from Vanderbilt University, remarked, “AI companies utilizing generative models are confronted with a very grave issue. They must exercise prudence regarding the data they employ and how they employ it, lest they face legal ramifications.”