top of page

20231209 AIGC数字人生成探索及测试结果

AIGC Digital Human Generation Exploration and Test Results

ROLE

AIGC Engineer

DESCRIPTION

Developed a comprehensive live streaming digital human system by integrating large language models, voice synthesis, and lip-sync generation technologies, enabling real-time user interaction and character-based responses.

Created a video digital human generation system, focusing on high-fidelity lip-syncing and facial expression transfers using AI-driven technology, ensuring consistency in visual and emotional expression.

Optimized the integration of AI technologies such as facial expression transfer and lip-sync drivers, ensuring smooth transitions between actions and emotions, while enhancing the digital human's interaction capabilities.

Refined motion and pose transfer tools, improving the ability to animate characters with more natural movements, integrating seamlessly with real-time video content.

YEAR

2024

GENRE

  • Live Streaming Digital Human System

  • AI-Driven Facial Expression Transfer

  • Lip Sync Driver System

  • Motion and Pose Transfer System

  • Tone Color Variation with SO-VITS-SVC

  • AI-Generated Mandarin and Lip Sync Integration

PLATFORM

20250201 - Live Streaming Digital Human Production Technology

By integrating large language models, voice synthesis, and lip-sync generation technologies, a comprehensive live streaming digital human system was developed. This system is capable of interacting with users, answering questions based on the character design.

20250201 
Keling Lip Sync - Video Digital Human Generation

20240806 - LivePortrait - Facial Expression Transfer

Installation method: Standalone software, CG client.
A tool that accurately mimics human faces to generate AI-driven digital human videos, with high fidelity to facial expressions.

20240806 - SadTalker Lip Sync Driver System

Supports eye blinking and image-based talking digital humans.
Disadvantages: Output quality is suboptimal, and it cannot generate body movements. Post-production compositing is required for adjustments.

20240806 - MiniCMotion - Motion and Pose Transfer

20240322 - Tone Color Variation Test

The SO-VITS-SVC voice generation model was fine-tuned with Lora to enable tone color variation. This model can alter the tone color based on the pitch of singing.

20231205 - AI-Generated Mandarin and Lip Sync

bottom of page