Tag

ModelScope

1 views collected around this technical thread.

Rare Earth Juejin Tech Community
Rare Earth Juejin Tech Community
Jan 21, 2024 · Artificial Intelligence

Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications

This article explains the concepts of pretraining and fine‑tuning for large language models, compares full‑parameter, LoRA and QLoRA approaches, discusses resource consumption, introduces the ModelScope SWIFT framework with code examples, and shows how fine‑tuning can improve data‑visualisation tasks while reducing token usage.

Fine-tuningLLMLoRA
0 likes · 24 min read
Understanding Pretraining and Fine‑Tuning of Large Language Models: Methods, Resources, and Practical Applications
DataFunSummit
DataFunSummit
Nov 20, 2023 · Artificial Intelligence

ModelScope Agents: Open‑Source LLM Agent Framework and Practical Guide

This article introduces ModelScope Agents, an open‑source LLM‑based agent framework that addresses limitations of GPT Store, outlines its features, provides installation and usage instructions, showcases a RPG game example, and invites the community to contribute to its roadmap.

AGENT frameworkAILLM
0 likes · 7 min read
ModelScope Agents: Open‑Source LLM Agent Framework and Practical Guide
DataFunSummit
DataFunSummit
Nov 13, 2023 · Artificial Intelligence

SWIFT: A Scalable Light‑Weight Training and Inference Framework for Efficient Model Fine‑Tuning

SWIFT is an open‑source, PyTorch‑based framework that integrates multiple efficient fine‑tuning methods such as LoRA, QLoRA, Adapter, and the proprietary ResTuning, enabling developers to fine‑tune large language and multimodal models on consumer‑grade GPUs with significantly reduced memory and compute requirements.

Fine-tuningLoRAModelScope
0 likes · 13 min read
SWIFT: A Scalable Light‑Weight Training and Inference Framework for Efficient Model Fine‑Tuning
DataFunTalk
DataFunTalk
Sep 23, 2023 · Artificial Intelligence

Paraformer: An Industrial Non‑Autoregressive End‑to‑End Speech Recognition Model and Its Deployment on ModelScope

This article introduces the Paraformer non‑autoregressive end‑to‑end speech recognition model released by Alibaba DAMO Academy, details its architecture, training strategies, large‑scale performance, and provides step‑by‑step guidance for using and fine‑tuning the model on the ModelScope platform with the FunASR toolkit.

ASRModelScopeParaformer
0 likes · 13 min read
Paraformer: An Industrial Non‑Autoregressive End‑to‑End Speech Recognition Model and Its Deployment on ModelScope
DataFunSummit
DataFunSummit
Aug 24, 2023 · Artificial Intelligence

Panoramic Indoor Layout Estimation with Vision Transformer (PanoViT)

This article introduces the PanoViT model, a vision‑transformer‑based approach for indoor layout estimation from panoramic images, covering its research background, architectural components, experimental results on public datasets, and step‑by‑step usage within ModelScope.

3D reconstructionModelScopecomputer vision
0 likes · 8 min read
Panoramic Indoor Layout Estimation with Vision Transformer (PanoViT)
DataFunTalk
DataFunTalk
Aug 14, 2023 · Artificial Intelligence

Open-Source AI Portrait Generation with FaceChain: Setup, Usage, and Underlying Principles

This article introduces the open‑source FaceChain AI portrait generation project, detailing background, demo results, environment setup on ModelScope notebooks, step‑by‑step usage instructions, and an in‑depth explanation of the Stable Diffusion‑based LoRA training and inference pipeline.

AI PortraitLoRAModelScope
0 likes · 7 min read
Open-Source AI Portrait Generation with FaceChain: Setup, Usage, and Underlying Principles
DataFunSummit
DataFunSummit
Jul 17, 2023 · Artificial Intelligence

Introduction to ModelScope Community's Fundamental NLP Models and Their Applications

This article introduces the ModelScope community's suite of foundational NLP models—including tokenization, POS tagging, NER, and text representation—detailing their architectures, performance, application scenarios, while also highlighting research contributions such as the ACE framework and retrieval‑enhanced techniques.

Entity RecognitionModelScopeNLP
0 likes · 21 min read
Introduction to ModelScope Community's Fundamental NLP Models and Their Applications
DataFunSummit
DataFunSummit
Jun 15, 2023 · Artificial Intelligence

Paraformer: An Industrial Non‑Autoregressive End‑to‑End Speech Recognition Model

This article introduces the Paraformer model released by Alibaba DAMO Academy on ModelScope, detailing its non‑autoregressive architecture, training strategies, performance on benchmark datasets, and step‑by‑step guidance for fine‑tuning and deploying the model using FunASR and ModelScope pipelines.

ASRModelScopeParaformer
0 likes · 13 min read
Paraformer: An Industrial Non‑Autoregressive End‑to‑End Speech Recognition Model
DataFunSummit
DataFunSummit
Apr 21, 2023 · Artificial Intelligence

Fine‑Tuning a ViT Image Classification Model on a Small Flower Dataset Using ModelScope

This tutorial walks through the complete process of fine‑tuning a Vision Transformer (ViT) model for 14‑class flower image classification on ModelScope, covering dataset preparation, model loading, training configuration, evaluation, and inference with practical code examples.

Fine-tuningModelScopeViT
0 likes · 14 min read
Fine‑Tuning a ViT Image Classification Model on a Small Flower Dataset Using ModelScope
DataFunSummit
DataFunSummit
Apr 17, 2023 · Artificial Intelligence

Large‑Scale Table Pretraining Model SPACE‑T: Background, Architecture, and Applications

The article presents Alibaba DAMO Academy's large‑scale table pretraining model SPACE‑T, explaining the background and trends of TableQA and Text‑to‑SQL, detailing the model’s design and training data, showcasing its deployment on ModelScope and Alibaba Cloud, and outlining future directions and practical impact.

AILarge-Scale ModelsModelScope
0 likes · 11 min read
Large‑Scale Table Pretraining Model SPACE‑T: Background, Architecture, and Applications
DataFunSummit
DataFunSummit
Apr 13, 2023 · Artificial Intelligence

ModelScope CV Model Overview: Visual Detection and Keypoint Applications

This article presents a comprehensive overview of ModelScope's computer‑vision models, detailing visual detection and keypoint solutions—including VitDet, YOLOX, res2net, HRNet, and 3D pose models—their architectures, performance highlights, real‑world applications, and future development plans.

AI modelsModelScopecomputer vision
0 likes · 11 min read
ModelScope CV Model Overview: Visual Detection and Keypoint Applications
DataFunSummit
DataFunSummit
Apr 7, 2023 · Artificial Intelligence

Comprehensive Overview of OCR: Types, Models, Pre‑training Techniques, and DIY Pipelines on ModelScope

This article provides a detailed introduction to OCR technology, covering its fundamental concepts, major categories (document, scene, and handwritten OCR), typical processing pipelines, a suite of open‑source models on ModelScope—including detection, recognition, and table OCR—and recent multimodal pre‑training methods such as VLDoc and VLPT.

Document UnderstandingHandwritten RecognitionModelScope
0 likes · 15 min read
Comprehensive Overview of OCR: Types, Models, Pre‑training Techniques, and DIY Pipelines on ModelScope
DataFunSummit
DataFunSummit
Apr 1, 2023 · Artificial Intelligence

Overview of Understanding and Editing Vision Models in the ModelScope Community

This article introduces ModelScope's community‑released visual models, detailing the categorization of understanding versus editing models, the two‑stage coarse‑to‑fine segmentation pipeline for matting, and four editing applications—style transfer, portrait beautification, skin enhancement, and anime‑style conversion—while also previewing upcoming sky‑replacement and video‑matting models.

AIModelScopeVisual Editing
0 likes · 9 min read
Overview of Understanding and Editing Vision Models in the ModelScope Community