Skip to content

on windows 10: error when rendering #6

@linhcentrio

Description

@linhcentrio

`(D:\Talking_head\MultiTalk-Code\venv) D:\Talking_head\MultiTalk-Code>scripts\demo.bat multi
Some weights of the model checkpoint at facebook/wav2vec2-large-xlsr-53 were not used when initializing Wav2Vec2Model: ['project_hid.weight', 'project_hid.bias', 'project_q.weight', 'project_q.bias', 'quantizer.weight_proj.weight', 'quantizer.codevectors', 'quantizer.weight_proj.bias']

  • This IS expected if you are initializing Wav2Vec2Model from the checkpoint of a model trained on another task or with another architecture (e.g. initializing a BertForSequenceClassification model from a BertForPreTraining model).
  • This IS NOT expected if you are initializing Wav2Vec2Model from the checkpoint of a model that you expect to be exactly identical (initializing a BertForSequenceClassification model from a BertForSequenceClassification model).
    => loading checkpoint 'checkpoints/stage2.pth.tar'
    => loaded checkpoint 'checkpoints/stage2.pth.tar'
    Generating facial animation for demo/input\English_WTT5UTZQ9K8_8.wav...
    Save facial animation in demo/output/English_WTT5UTZQ9K8_8.npy
    rendering: English_WTT5UTZQ9K8_8
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [in#0 @ 000001D38581E400] Error opening input: Permission denied
    Error opening input file D:\Talking_head\MultiTalk-Code\demo\output\English_WTT5UTZQ9K8_8\tmpha02tsy
    .mp4.
    Error opening input files: Permission denied
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [aist#0:0/pcm_s16le @ 0000018CC3034CC0] Guessed Channel Layout: stereo
    Input #0, wav, from 'demo/input\English_WTT5UTZQ9K8_8.wav':
    Metadata:
    encoder : Lavf57.83.100
    Duration: 00:00:13.01, bitrate: 1536 kb/s
    Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 48000 Hz, stereo, s16, 1536 kb/s
    [in#1 @ 0000018CC2FEE640] Error opening input: No such file or directory
    Error opening input file demo/output/English_WTT5UTZQ9K8_8\English_WTT5UTZQ9K8_8.mp4.
    Error opening input files: No such file or directory
    Generating facial animation for demo/input\French_JATq1mUhfiA_8.wav...
    Save facial animation in demo/output/French_JATq1mUhfiA_8.npy
    rendering: French_JATq1mUhfiA_8
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [in#0 @ 0000023462504E40] Error opening input: Permission denied
    Error opening input file D:\Talking_head\MultiTalk-Code\demo\output\French_JATq1mUhfiA_8\tmpyie1iay7.mp4.
    Error opening input files: Permission denied
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [aist#0:0/pcm_s16le @ 0000020EA0101880] Guessed Channel Layout: stereo
    Input #0, wav, from 'demo/input\French_JATq1mUhfiA_8.wav':
    Metadata:
    encoder : Lavf57.83.100
    Duration: 00:00:05.74, bitrate: 1536 kb/s
    Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 48000 Hz, stereo, s16, 1536 kb/s
    [in#1 @ 0000020EA0102140] Error opening input: No such file or directory
    Error opening input file demo/output/French_JATq1mUhfiA_8\French_JATq1mUhfiA_8.mp4.
    Error opening input files: No such file or directory
    Generating facial animation for demo/input\Greek_A79aeui1HqM_15.wav...
    Save facial animation in demo/output/Greek_A79aeui1HqM_15.npy
    rendering: Greek_A79aeui1HqM_15
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [in#0 @ 000002AD487703C0] Error opening input: Permission denied
    Error opening input file D:\Talking_head\MultiTalk-Code\demo\output\Greek_A79aeui1HqM_15\tmpiim13oq3.mp4.
    Error opening input files: Permission denied
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [aist#0:0/pcm_s16le @ 0000018445C7AF40] Guessed Channel Layout: stereo
    Input #0, wav, from 'demo/input\Greek_A79aeui1HqM_15.wav':
    Metadata:
    encoder : Lavf57.83.100
    Duration: 00:00:03.33, bitrate: 1536 kb/s
    Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 48000 Hz, stereo, s16, 1536 kb/s
    [in#1 @ 0000018445C7B100] Error opening input: No such file or directory
    Error opening input file demo/output/Greek_A79aeui1HqM_15\Greek_A79aeui1HqM_15.mp4.
    Error opening input files: No such file or directory
    Generating facial animation for demo/input\Italian_72pdx3tZwto_4.wav...
    Save facial animation in demo/output/Italian_72pdx3tZwto_4.npy
    rendering: Italian_72pdx3tZwto_4
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [in#0 @ 000001F25B1E7740] Error opening input: Permission denied
    Error opening input file D:\Talking_head\MultiTalk-Code\demo\output\Italian_72pdx3tZwto_4\tmp2odwzby9.mp4.
    Error opening input files: Permission denied
    ffmpeg version 7.0.2 Copyright (c) 2000-2024 the FFmpeg developers
    built with clang version 18.1.8
    configuration: --prefix=/d/bld/ffmpeg_1724645294718/_h_env/Library --cc=clang.exe --cxx=clang++.exe --nm=llvm-nm --ar=llvm-ar --disable-doc --disable-openssl --enable-demuxer=dash --enable-hardcoded-tables --enable-libfreetype --enable-libharfbuzz --enable-libfontconfig --enable-libopenh264 --enable-libdav1d --ld=lld-link --target-os=win64 --enable-cross-compile --toolchain=msvc --host-cc=clang.exe --extra-libs=ucrt.lib --extra-libs=vcruntime.lib --extra-libs=oldnames.lib --strip=llvm-strip --disable-stripping --host-extralibs= --disable-libopenvino --enable-gpl --enable-libx264 --enable-libx265 --enable-libaom --enable-libsvtav1 --enable-libxml2 --enable-pic --enable-shared --disable-static --enable-version3 --enable-zlib --enable-libopus --pkg-config=/d/bld/ffmpeg_1724645294718/_build_env/Library/bin/pkg-config
    libavutil 59. 8.100 / 59. 8.100
    libavcodec 61. 3.100 / 61. 3.100
    libavformat 61. 1.100 / 61. 1.100
    libavdevice 61. 1.100 / 61. 1.100
    libavfilter 10. 1.100 / 10. 1.100
    libswscale 8. 1.100 / 8. 1.100
    libswresample 5. 1.100 / 5. 1.100
    libpostproc 58. 1.100 / 58. 1.100
    [aist#0:0/pcm_s16le @ 000002C04AC77EC0] Guessed Channel Layout: stereo
    Input #0, wav, from 'demo/input\Italian_72pdx3tZwto_4.wav':
    Metadata:
    encoder : Lavf57.83.100
    Duration: 00:00:07.57, bitrate: 1536 kb/s
    Stream #0:0: Audio: pcm_s16le ([1][0][0][0] / 0x0001), 48000 Hz, stereo, s16, 1536 kb/s
    [in#1 @ 000002C04AC7CE80] Error opening input: No such file or directory
    Error opening input file demo/output/Italian_72pdx3tZwto_4\Italian_72pdx3tZwto_4.mp4.
    Error opening input files: No such file or directory`

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions