ChatGPT İstem Mühendisliği
Bu bölümde, ChatGPT için en son istem mühendisliği tekniklerini, ipuçlarını, uygulamaları, sınırlamaları, makaleleri ve ek okuma materyallerini ele alıyoruz.
Bu bölüm yoğun bir geliştirme sürecindedir.
Konular:
ChatGPT Tanıtımı
ChatGPT, OpenAI tarafından eğitilen (opens in a new tab) ve sohbet edebilme yeteneğine sahip yeni bir modeldir. Bu model, bir diyalog bağlamında uygun yanıtlar sağlamak üzere bir isteme içindeki talimatlara uyma üzerine eğitilmiştir. ChatGPT, soruları yanıtlamaya, tarif önermeye, belirli bir tarzda şarkı sözleri yazmaya, kod oluşturmaya ve çok daha fazlasına yardımcı olabilir.
ChatGPT, İnsan Geri Bildiriminden Güçlendirme Öğrenme (RLHF) kullanılarak eğitilmiştir. Bu model, önceki GPT iterasyonlarından çok daha yetenekli olmasına (ve ayrıca zararlı ve gerçeği yansıtmayan çıktıları azaltma üzerine eğitilmiş olmasına) rağmen, hala sınırlamaları vardır. Beton örneklerle bazı yetenekleri ve sınırlamaları ele alalım.
ChatGPT'nin araştırma önizlemesini burada kullanabilirsiniz ama aşağıdaki örnekler için OpenAI Playground'daki Chat
modunu kullanacağız.
Konuşma Görevinin İncelenmesi
Önceki kılavuzlardan birinde, konuşma yetenekleri ve rol belirlemeyi biraz ele aldık. LLM'yi belirli bir tarzda, belirli bir niyetle, belirli bir davranış ve kimlikle konuşma yapacak şekilde nasıl yönlendireceğimizi ele aldık.
Daha teknik ve bilimsel yanıtlar verebilen bir konuşma sistemini oluşturduğumuz önceki basit örneğimizi gözden geçirelim.
İstem:
Aşağıdaki, bir AI araştırma asistanıyla yapılan bir konuşmadır. Asistanın tonu teknik ve bilimseldir.
İnsan: Merhaba, sen kimsin?
AI: Selam! Ben bir AI araştırma asistanıyım. Bugün size nasıl yardımcı olabilirim?
İnsan: Bana kara deliklerin oluşumu hakkında bilgi verebilir misin?
AI:
Yukarıdaki örnekte, iki önemli bileşeni görebilirsiniz:
- bir niyet veya sohbet botunun ne olduğu açıklaması
- sohbet botunun yanıtları kullanma stilini veya tonunu yönlendiren kimlik
Yukarıdaki basit örnek, text-davinci-003
kullanan metin tamamlama API'leri ile iyi çalışır. Daha yakın zamanda, OpenAI gpt-3.5-turbo
olarak adlandırılan daha güçlü ve daha ucuz bir modelin (opens in a new tab) bu tür bir işlevsellik (chat tamamlama) için özellikle tasarlandığını duyurdu. Aslında OpenAI, bu tip durumlarda bile en iyi modelin bu olduğunu önermektedir. ChatGPT API'lerini kullanmanın diğer faydaları arasında maliyetin önemli ölçüde azalması (%90) ve verimliliği bulunur.
Snap Inc. ve Instacart gibi büyük şirketler, ChatGPT tarafından desteklenen, kişiselleştirilmiş önerilerden açık uçlu alışveriş hedeflerine kadar çeşitli konuşma özelliklerini ürünlerine zaten entegre ediyor.
ChatGPT ile Konuşmalar
Çok Turlu Konuşmalar
ChatGPT'nin yeteneklerini göstermeye başlamak için, yukarıdaki chatbot asistan örneğini kullanacağız ve sonuçları tartışacağız. text-davinci-003
modeline kıyasla, ChatGPT'yi güçlendiren gpt-3.5-turbo
modeli giriş olarak bir sohbet formatı kullanır. Model, bir dizi mesajı giriş olarak bekler ve bu mesajları kullanarak bir yanıt oluşturur.
Giriş:
SYSTEM: Siz bir AI araştırma asistanısınız. Teknik ve bilimsel bir ton kullanıyorsunuz.
USER: Merhaba, kim olduğunuzu öğrenebilir miyim?
ASSISTANT: Selam! Ben bir AI araştırma asistanıyım. Bugün size nasıl yardımcı olabilirim?
USER: Bana kara deliklerin oluşumu hakkında bilgi verebilir misiniz?
Çıktı:
ASSISTANT: Tabii ki! Kara delikler, kütlesi büyük bir yıldızın yakıtını tüketip kendi kütle çekimine kapılması sonucu oluşur. Bu çöküş, yıldızın inanılmaz derecede yoğun hale gelmesine ve ışığın bile kaçamadığı bu kadar güçlü bir kütle çekim alanı oluşturmasına neden olur. Dönüşü olmayan bu noktaya olay ufkı denir. Bir yıldız olay ufkunu geçtiğinde, artık bir kara delik olarak kabul edilir...
Yukarıdaki örnekte, girişi ve çıktıyı basitleştirdim ama ChatGPT sohbet tamamlama API'si mesajların belirli bir formatta olmasını gerektirir. Bu örneğin OpenAI Playground'daki Sohbet Modu
kullanılarak nasıl görüneceğinin bir ekran görüntüsünü aşağıya ekledim:
Örneğimiz için daha resmi API çağrısı aşağıdaki örneğe benzer bir şey olacaktır:
import openai
openai.ChatCompletion.create(
model="gpt-3.5-turbo",
messages=[
{"role": "system", "content": "Siz bir AI araştırma asistanısınız. Teknik ve bilimsel bir ton kullanıyorsunuz."},
{"role": "user", "content": "Merhaba, kim olduğunuzu öğrenebilir miyim?"},
{"role": "assistant", "content": "Selam! Ben bir AI araştırma asistanıyım. Bugün size nasıl yardımcı olabilirim?"},
{"role": "user", "content": "Bana kara deliklerin oluşumu hakkında bilgi verebilir misiniz?"}
]
)
Aslında, gelecekte geliştiricilerin ChatGPT ile etkileşime girme şeklinin [Chat Markup Language] (https://github.com/openai/openai-python/blob/main/chatml.md (opens in a new tab)) (kısaca ChatML) aracılığıyla yapılması beklenmektedir.
Referanslar
- Column Type Annotation using ChatGPT (opens in a new tab) (June 2023)
- Enhancing Programming eTextbooks with ChatGPT Generated Counterfactual-Thinking-Inspired Questions (opens in a new tab) (June 2023)
- ChatGPT an ENFJ, Bard an ISTJ: Empirical Study on Personalities of Large Language Models (opens in a new tab) (May 2023)
- A Systematic Study and Comprehensive Evaluation of ChatGPT on Benchmark Datasets (opens in a new tab) (May 2023)
- Chatbots put to the test in math and logic problems: A preliminary comparison and assessment of ChatGPT-3.5, ChatGPT-4, and Google Bard (opens in a new tab) (May 2023)
- GPT Models in Construction Industry: Opportunities, Limitations, and a Use Case Validation (opens in a new tab) (May 2023)
- Fairness of ChatGPT (opens in a new tab) (May 2023)
- Mapping ChatGPT in Mainstream Media: Early Quantitative Insights through Sentiment Analysis and Word Frequency Analysis (opens in a new tab) (May 2023)
- A Survey on ChatGPT: AI-Generated Contents, Challenges, and Solutions (opens in a new tab) (May 2023)
- Do Language Models Know When They're Hallucinating References? (opens in a new tab) (May 2023)
- [HowkGPT: Investigating the Detection of ChatGPT-generated University Student Homework through Context-Aware Perplexity Analysis]
- Playing repeated games with Large Language Models (opens in a new tab) (May 2023)
- Zero is Not Hero Yet: Benchmarking Zero-Shot Performance of LLMs for Financial Tasks (opens in a new tab) (May 2023)
- Leveraging LLMs for KPIs Retrieval from Hybrid Long-Document: A Comprehensive Framework and Dataset (opens in a new tab) (May 2023)
- Marked Personas: Using Natural Language Prompts to Measure Stereotypes in Language Models (opens in a new tab) (May 2023)
- The Larger They Are, the Harder They Fail: Language Models do not Recognize Identifier Swaps in Python (opens in a new tab) (May 2023)
- InternGPT: Solving Vision-Centric Tasks by Interacting with ChatGPT Beyond Language (opens in a new tab) (May 2023)
- Narrative XL: A Large-scale Dataset For Long-Term Memory Models (opens in a new tab) (May 2023)
- Does ChatGPT have Theory of Mind? (opens in a new tab) (May 2023)
- Can LLM Already Serve as A Database Interface? A BIg Bench for Large-Scale Database Grounded Text-to-SQLs (opens in a new tab) (May 2023)
- ZeroSCROLLS: A Zero-Shot Benchmark for Long Text Understanding (opens in a new tab) (May 2023)
- Navigating Prompt Complexity for Zero-Shot Classification: A Study of Large Language Models in Computational Social Science (opens in a new tab) (May 2023)
- ChatGPT-EDSS: Empathetic Dialogue Speech Synthesis Trained from ChatGPT-derived Context Word Embeddings (opens in a new tab) (May 2023)
- Can LLMs facilitate interpretation of pre-trained language models? (opens in a new tab) (May 2023)
- Can ChatGPT Detect Intent? Evaluating Large Language Models for Spoken Language Understanding (opens in a new tab) (May 2023)
- LLM-empowered Chatbots for Psychiatrist and Patient Simulation: Application and Evaluation (opens in a new tab) (May 2023)
- ChatGPT as your Personal Data Scientist (opens in a new tab) (May 2023)
- Are Large Language Models Good Evaluators for Abstractive Summarization? (opens in a new tab) (May 2023)
- Can ChatGPT Defend the Truth? Automatic Dialectical Evaluation Elicits LLMs' Deficiencies in Reasoning (opens in a new tab) (May 2023)
- Evaluating ChatGPT's Performance for Multilingual and Emoji-based Hate Speech Detection (opens in a new tab) (May 2023)
- ChatGPT to Replace Crowdsourcing of Paraphrases for Intent Classification: Higher Diversity and Comparable Model Robustness (opens in a new tab) (May 2023)
- Distilling ChatGPT for Explainable Automated Student Answer Assessment (opens in a new tab) (May 2023)
- Prompt ChatGPT In MNER: Improved multimodal named entity recognition method based on auxiliary refining knowledge from ChatGPT (opens in a new tab) (May 2023)
- ChatGPT Is More Likely to Be Perceived as Male Than Female (opens in a new tab) (May 2023)
- Observations on LLMs for Telecom Domain: Capabilities and Limitations (opens in a new tab) (May 2023)
- Bits of Grass: Does GPT already know how to write like Whitman? (opens in a new tab) (May 2023)
- Are Large Language Models Fit For Guided Reading? (opens in a new tab) (May 2023)
- ChatGPT Perpetuates Gender Bias in Machine Translation and Ignores Non-Gendered Pronouns: Findings across Bengali and Five other Low-Resource Languages (opens in a new tab) (May 2023)
- BAD: BiAs Detection for Large Language Models in the context of candidate screening (opens in a new tab) (May 2023)
- MemoryBank: Enhancing Large Language Models with Long-Term Memory (opens in a new tab) (May 2023)
- Knowledge Graph Completion Models are Few-shot Learners: An Empirical Study of Relation Labeling in E-commerce with LLMs (opens in a new tab) (May 2023)
- A Preliminary Analysis on the Code Generation Capabilities of GPT-3.5 and Bard AI Models for Java Functions (opens in a new tab) (May 2023)
- ChatGPT-4 Outperforms Experts and Crowd Workers in Annotating Political Twitter Messages with Zero-Shot Learning (opens in a new tab) (April 2023)
- ChatGPT Beyond English: Towards a Comprehensive Evaluation of Large Language Models in Multilingual Learning (opens in a new tab) (April 2023)
- Distinguishing ChatGPT(-3.5, -4)-generated and human-written papers through Japanese stylometric analysis (opens in a new tab) (April 2023)
- Zero-shot Temporal Relation Extraction with ChatGPT (opens in a new tab) (April 2023)
- Can ChatGPT and Bard Generate Aligned Assessment Items? A Reliability Analysis against Human Performance (opens in a new tab) (April 2023)
- Are Large Language Models Ready for Healthcare? A Comparative Study on Clinical Language Understanding (opens in a new tab) (April 2023)
- The Wall Street Neophyte: A Zero-Shot Analysis of ChatGPT Over MultiModal Stock Movement Prediction Challenges (opens in a new tab) (April 2023)
- Toxicity in ChatGPT: Analyzing Persona-assigned Language Models (opens in a new tab) (April 2023)
- Multi-step Jailbreaking Privacy Attacks on ChatGPT (opens in a new tab) (April 2023)
- Is ChatGPT a Good Sentiment Analyzer? A Preliminary Study (opens in a new tab) (April 2023)
- A Preliminary Evaluation of ChatGPT for Zero-shot Dialogue Understanding (opens in a new tab) (April 2023)
- Extractive Summarization via ChatGPT for Faithful Summary Generation (opens in a new tab) (April 2023)
- What does ChatGPT return about human values? Exploring value bias in ChatGPT using a descriptive value theory (opens in a new tab) (April 2023)
- On the Evaluations of ChatGPT and Emotion-enhanced Prompting for Mental Health Analysis (opens in a new tab) (April 2023)
- ChatGPT-Crawler: Find out if ChatGPT really knows what it's talking about (opens in a new tab) (April 2023)
- Should ChatGPT be Biased? Challenges and Risks of Bias in Large Language Models (opens in a new tab) (April 2023)
- Synthesis of Mathematical programs from Natural Language Specifications (opens in a new tab) (April 2023)
- Large language models effectively leverage document-level context for literary translation, but critical errors persist (opens in a new tab) (April 2023)
- Investigating Chain-of-thought with ChatGPT for Stance Detection on Social Media (opens in a new tab) (April 2023)
- ChatGPT for Shaping the Future of Dentistry: The Potential of Multi-Modal Large Language Model (opens in a new tab) (April 2023)
- Can Large Language Models Play Text Games Well? Current State-of-the-Art and Open Questions (opens in a new tab) (April 2023)
- Human-like Summarization Evaluation with ChatGPT (opens in a new tab) (April 2023)
- Evaluation of ChatGPT Family of Models for Biomedical Reasoning and Classification (opens in a new tab) (April 2023)
- Comparative Analysis of CHATGPT and the evolution of language models (opens in a new tab) (April 2023)
- Unleashing the Power of ChatGPT for Translation: An Empirical Study (opens in a new tab) (April 2023)
- Geotechnical Parrot Tales (GPT): Overcoming GPT hallucinations with prompt engineering for geotechnical applications (opens in a new tab) (April 2023)
- Unlocking the Potential of ChatGPT: A Comprehensive Exploration of its Applications, Advantages, Limitations, and Future Directions in Natural Language Processing (opens in a new tab) (April 2023)
- Summary of ChatGPT/GPT-4 Research and Perspective Towards the Future of Large Language Models (opens in a new tab) (April 2023)
- Is ChatGPT a Highly Fluent Grammatical Error Correction System? A Comprehensive Evaluation (opens in a new tab) (April 2023)
- Safety Analysis in the Era of Large Language Models: A Case Study of STPA using ChatGPT (opens in a new tab) (April 2023)
- Large language models can rate news outlet credibility (opens in a new tab) (April 2023)
- Can AI Chatbots Pass the Fundamentals of Engineering (FE) and Principles and Practice of Engineering (PE) Structural Exams? (opens in a new tab) (April 2023)
- Can AI Put Gamma-Ray Astrophysicists Out of a Job? (opens in a new tab) (March 2023)
- Comparing Abstractive Summaries Generated by ChatGPT to Real Summaries Through Blinded Reviewers and Text Classification Algorithms (opens in a new tab) (March 2023)
- HuggingGPT: Solving AI Tasks with ChatGPT and its Friends in HuggingFace (opens in a new tab) (March 2023)
- SelfCheckGPT: Zero-Resource Black-Box Hallucination Detection for Generative Large Language Models (opens in a new tab) (March 2023)
- WavCaps: A ChatGPT-Assisted Weakly-Labelled Audio Captioning Dataset for Audio-Language Multimodal Research (opens in a new tab) (March 2023)
- How well do Large Language Models perform in Arithmetic tasks? (opens in a new tab) (March 2023)
- Assessing Cross-Cultural Alignment between ChatGPT and Human Societies: An Empirical Study (opens in a new tab) (March 2023)
- Yes but.. Can ChatGPT Identify Entities in Historical Documents? (opens in a new tab) (March 2023)
- Evaluation of ChatGPT for NLP-based Mental Health Applications (opens in a new tab) (March 2023)
- A Perspectival Mirror of the Elephant: Investigating Language Bias on Google, ChatGPT, Wikipedia, and YouTube (opens in a new tab) (March 2023)
- ChatGPT or academic scientist? Distinguishing authorship with over 99% accuracy using off-the-shelf machine learning tools (opens in a new tab) (March 2023)
- Zero-shot Clinical Entity Recognition using ChatGPT (opens in a new tab) (March 2023)
- ChatGPT is a Knowledgeable but Inexperienced Solver: An Investigation of Commonsense Problem in Large Language Models (opens in a new tab) (March 2023)
- ChatGPT4PCG Competition: Character-like Level Generation for Science Birds (opens in a new tab) (March 2023)
- ChatGPT as a Factual Inconsistency Evaluator for Abstractive Text Summarization (opens in a new tab) (March 2023)
- Chat-REC: Towards Interactive and Explainable LLMs-Augmented Recommender System (opens in a new tab) (March 2023)
- A comprehensive evaluation of ChatGPT's zero-shot Text-to-SQL capability (opens in a new tab) (March 2023)
- Towards Making the Most of ChatGPT for Machine Translation (opens in a new tab) (March 2023)
- Error Analysis Prompting Enables Human-Like Translation Evaluation in Large Language Models: A Case Study on ChatGPT (opens in a new tab) (March 2023)
- ChatGPT Outperforms Crowd-Workers for Text-Annotation Tasks (opens in a new tab) (March 2023)
- ChatGPT or Grammarly? Evaluating ChatGPT on Grammatical Error Correction Benchmark (opens in a new tab) (March 2023)
- ChatGPT and a New Academic Reality: AI-Written Research Papers and the Ethics of the Large Language Models in Scholarly Publishing (opens in a new tab) (March 2023)
- Are LLMs the Master of All Trades? : Exploring Domain-Agnostic Reasoning Skills of LLMs (opens in a new tab) (March 2023)
- Is ChatGPT A Good Keyphrase Generator? A Preliminary Study (opens in a new tab) (March 2023)
- MM-REACT: Prompting ChatGPT for Multimodal Reasoning and Action (opens in a new tab) (March 2023)
- Large Language Models Can Be Used to Estimate the Ideologies of Politicians in a Zero-Shot Learning Setting (opens in a new tab) (March 2023)
- Chinese Intermediate English Learners outdid ChatGPT in deep cohesion: Evidence from English narrative writing (opens in a new tab) (March 2023)
- A Comprehensive Capability Analysis of GPT-3 and GPT-3.5 Series Models (opens in a new tab) (March 2023)
- ChatGPT as the Transportation Equity Information Source for Scientific Writing (opens in a new tab) (March 2023)
- Translating Radiology Reports into Plain Language using ChatGPT and GPT-4 with Prompt Learning: Promising Results, Limitations, and Potential (opens in a new tab) (March 2023)
- ChatGPT Participates in a Computer Science Exam (opens in a new tab) (March 2023)
- Consistency Analysis of ChatGPT (opens in a new tab) (Mar 2023)
- Algorithmic Ghost in the Research Shell: Large Language Models and Academic Knowledge Creation in Management Research (opens in a new tab) (Mar 2023)
- Large Language Models in the Workplace: A Case Study on Prompt Engineering for Job Type Classification (opens in a new tab) (March 2023)
- Seeing ChatGPT Through Students' Eyes: An Analysis of TikTok Data (opens in a new tab) (March 2023)
- Extracting Accurate Materials Data from Research Papers with Conversational Language Models and Prompt Engineering -- Example of ChatGPT (opens in a new tab) (Mar 2023)
- ChatGPT is on the horizon: Could a large language model be all we need for Intelligent Transportation? (opens in a new tab) (Mar 2023)
- Making a Computational Attorney (opens in a new tab) (Mar 2023)
- Does Synthetic Data Generation of LLMs Help Clinical Text Mining? (opens in a new tab) (Mar 2023)
- MenuCraft: Interactive Menu System Design with Large Language Models (opens in a new tab) (Mar 2023)
- A Comprehensive Survey of AI-Generated Content (AIGC): A History of Generative AI from GAN to ChatGPT (opens in a new tab) (Mar 2023)
- Exploring the Feasibility of ChatGPT for Event Extraction (opens in a new tab)
- ChatGPT: Beginning of an End of Manual Annotation? Use Case of Automatic Genre Identification (opens in a new tab) (Mar 2023)
- Is ChatGPT a Good NLG Evaluator? A Preliminary Study (opens in a new tab) (Mar 2023)
- Will Affective Computing Emerge from Foundation Models and General AI? A First Evaluation on ChatGPT (opens in a new tab) (Mar 2023)
- UZH_CLyp at SemEval-2023 Task 9: Head-First Fine-Tuning and ChatGPT Data Generation for Cross-Lingual Learning in Tweet Intimacy Prediction (opens in a new tab) (Mar 2023)
- How to format inputs to ChatGPT models (opens in a new tab) (Mar 2023)
- Can ChatGPT Assess Human Personalities? A General Evaluation Framework (opens in a new tab) (Mar 2023)
- Cross-Lingual Summarization via ChatGPT (opens in a new tab) (Feb 2023)
- ChatAug: Leveraging ChatGPT for Text Data Augmentation (opens in a new tab) (Feb 2023)
- Dr ChatGPT, tell me what I want to hear: How prompt knowledge impacts health answer correctness (opens in a new tab) (Feb 2023)
- An Independent Evaluation of ChatGPT on Mathematical Word Problems (MWP) (opens in a new tab) (Feb 2023)
- ChatGPT: A Meta-Analysis after 2.5 Months (opens in a new tab) (Feb 2023)
- Let's have a chat! A Conversation with ChatGPT: Technology, Applications, and Limitations (opens in a new tab) (Feb 2023)
- Check Your Facts and Try Again: Improving Large Language Models with External Knowledge and Automated Feedback (opens in a new tab) (Feb 2023)
- On the Robustness of ChatGPT: An Adversarial and Out-of-distribution Perspective (opens in a new tab) (Feb 2023)
- How Generative AI models such as ChatGPT can be (Mis)Used in SPC Practice, Education, and Research? An Exploratory Study (opens in a new tab) (Feb 2023)
- Can ChatGPT Understand Too? A Comparative Study on ChatGPT and Fine-tuned BERT (opens in a new tab) (Feb 2023)
- A Prompt Pattern Catalog to Enhance Prompt Engineering with ChatGPT (opens in a new tab) (Feb 2023)
- Zero-Shot Information Extraction via Chatting with ChatGPT (opens in a new tab) (Feb 2023)
- ChatGPT: Jack of all trades, master of none (opens in a new tab) (Feb 2023)
- A Pilot Evaluation of ChatGPT and DALL-E 2 on Decision Making and Spatial Reasoning (opens in a new tab) (Feb 2023)
- Netizens, Academicians, and Information Professionals' Opinions About AI With Special Reference To ChatGPT (opens in a new tab) (Feb 2023)
- Linguistic ambiguity analysis in ChatGPT (opens in a new tab) (Feb 2023)
- ChatGPT versus Traditional Question Answering for Knowledge Graphs: Current Status and Future Directions Towards Knowledge Graph Chatbots (opens in a new tab) (Feb 2023)
- What ChatGPT and generative AI mean for science (opens in a new tab) (Feb 2023)
- Applying BERT and ChatGPT for Sentiment Analysis of Lyme Disease in Scientific Literature (opens in a new tab) (Feb 2023)
- Exploring AI Ethics of ChatGPT: A Diagnostic Analysis (opens in a new tab) (Jan 2023)
- ChatGPT for Good? On Opportunities and Challenges of Large Language Models for Education (opens in a new tab) (Jan 2023)
- The political ideology of conversational AI: Converging evidence on ChatGPT's pro-environmental, left-libertarian orientation (opens in a new tab) (Jan 2023)
- Techniques to improve reliability - OpenAI Cookbook (opens in a new tab)
- Awesome ChatGPT Prompts (opens in a new tab)
- Introducing ChatGPT (opens in a new tab) (Nov 2022)