π€ Language AI Tasks (NLP Task Overview)
1. π§ Understanding (Analysis)β
Task | Description | Examples |
---|---|---|
Text Classification | Assign a label to text | Spam detection, sentiment analysis |
Named Entity Recognition (NER) | Extract entities like names, locations | βBarack Obamaβ β PERSON |
Part-of-Speech Tagging (POS) | Label each wordβs role | βrunβ = verb, βfastβ = adverb |
Parsing | Analyze sentence structure | Dependency trees, constituency trees |
Coreference Resolution | Link pronouns to nouns | "Jane dropped her book. She picked it up." β "She" = Jane |
Topic Modeling | Discover themes in text | Grouping forum posts by subject |
2. βοΈ Generationβ
Task | Description | Examples |
---|---|---|
Text Generation | Generate new text from a prompt | GPT writing a story |
Machine Translation | Convert text between languages | English β French |
Summarization | Condense text into a shorter version | News article β summary |
Question Answering | Answer from a passage | "What is photosynthesis?" |
Dialogue Generation | Generate conversational replies | Chatbots, assistants |
Code Generation | Convert text to code | "Write a Python function that..." |
3. π Retrieval & Rankingβ
Task | Description | Examples |
---|---|---|
Information Retrieval (IR) | Retrieve relevant documents/passages | Google search, RAG systems |
Reranking | Reorder search results | Hybrid systems using BERT |
Semantic Search | Find meaning-based matches | Vector search |
4. π Representation & Embeddingβ
Task | Description | Examples |
---|---|---|
Word Embeddings | Map words to vector space | Word2Vec, GloVe |
Sentence/Doc Embeddings | Represent longer text | Sentence-BERT, CLS token |
Similarity Measurement | Compare two texts | Plagiarism detection |
5. π§© Multimodal / Hybrid Tasksβ
Task | Description | Examples |
---|---|---|
Image Captioning | Describe images using text | "A cat sitting on a chair" |
Visual QA | Answer questions about an image | "What color is the shirt?" |
Code + Text | Combine natural language and programming | GitHub Copilot, doc generators |