Podcast Avatar

Google NotebookLM enables fast summarization of PDF files, websites, YouTube videos, audio files, Google Docs, and Google Slides, producing high-quality podcast audio in a two-person dialogue format. Podcast Avatar is an all-in-one AI video generation application developed by TalkingAvatar.ai for creating such two-person podcast-style audio. It allows users to easily and quickly combine their imported videos with two-person podcast-style audio to create new podcast videos.

Currently, Podcast Avatar is only available for use on the Windows app.

Operation Process

Podcast Avatar offers two methods for generating Podcast Videos:

  • Import a video with two people and a two-person podcast audio. The AI will diarize the speakers and match the audio to the corresponding video characters.
  • Import two single-person videos and a two-person podcast audio, with the AI matching the audio to the appropriate speakers.
    In both cases, the workflow is similar for the user.

Step 1: Select the character video you want to edit and add it to the video track. You can either upload a local video or choose one from the free Library Avatar market provided by the platform.

Step 2: Upload the two-person podcast-style audio and add it to the audio timeline. Once added, the application will split the two-person podcast-style audio into two audio tracks based on speaker and automatically match the video characters that need to be synced. If the video contains multiple character avatars, you will need to select the avatar for each audio track.
(Note:The first time you upload an audio file, it may take longer for diarizing, so please be patient.)

Step 3: Additional configurations

  • AI Version:
    • AI Version 1.0: The initial algorithm model used in the 1.0 client. It produces noticeable lip movements.
    • AI Version 1.3: The latest algorithm model used in the 1.3 client. It offers clearer output with smoother lip-sync effects.
      AI Version 1.3 performs better than 1.0 in most scenarios.
  • Face Enhance:
    • The Face Enhancer can improve the resolution of the face,making facial features clearer and more detailed.

When there is only one video on the video timeline, the application allows you to use the "Save as My Avatar" function. This saves the video along with its currently linked voice model as a quick preset. Note: Audio files on the audio timeline will not be saved. Once the preset is successfully saved, you can find it in the Avatar Library - My Avatar section of the application.

Important Notes

Video Guideline

  • Video Requirements

    • Format: Import MP4 video files with a recommended resolution of at least 360p to ensure good final output quality.
    • Lighting: Avoid strong light or shadows; ensure the subject's face is clearly visible.
    • Content: Use stable footage with minimal shaking or rapid movements.
    • Subjects: The subject's facial expressions in the video should be natural and easy to adapt.
  • Legal and Copyright Requirements

    • Usage Authorization: Ensure that all video and audio materials used have proper legal authorization to avoid violating copyright or image rights.
    • Privacy Protection: If the video involves real people or their voices, obtain prior consent or confirm the legal use of the material.
  • Operational Notes (App)

    • Videos on the track can be split, trimmed, deleted, undone, or restored.
    • When importing multiple videos onto the track, ensure they have the same resolution and aspect ratio. Additionally, using videos with a consistent frame rate is recommended to prevent stuttering or lag during output.
  • Common Issues and Solutions

    • Issue 1: Shadows too dark around the face can lead to unstable lip-syncing.
    • Issue 2: Poor clarity, obstructions, or side profile angles exceeding 20° may result in subpar output.
    • Issue 3: Rapid body movements or continuous head shaking can negatively affect the output.
    • Issue 4: Subjects with heavy facial hair may experience reduced lip-syncing accuracy.
    • Issue 5: Subjects with very thick lips may result in poor lip-syncing performance.
    • Issue 6: The application currently has limited support for cartoon or animated characters.
    • Issue 7: The application does not currently support lip-syncing for animal characters.

Audio Guidelines

We value your privacy

We use cookies to enhance your browsing experience,serve personalized ads or content, and analyze our traffic.By clicking "Accept All", you consent to our use ofcookies.