Skip to content

Unicode Decoding Error #45

@gpavanb-osi

Description

@gpavanb-osi

Hi

I am getting the following error when trying out the code on a resume in English.

Error
UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 481: character maps to

Traceback
Traceback (most recent call last):
File "main.py", line 116, in
main()
File "main.py", line 33, in main
observations = extract()
File "main.py", line 68, in extract
observations['text'] = observations['file_path'].apply(lib.convert_pdf)
File "C:\Users\pgovindaraju\Desktop\Python_Projects\Talent-Acquisition\venv\lib\site-packages\pandas\core\series.py", line 3591, in apply
mapped = lib.map_infer(values, f, convert=convert_dtype)
File "pandas_libs\lib.pyx", line 2217, in pandas._libs.lib.map_infer
File "C:\Users\pgovindaraju\Desktop\Python_Projects\Talent-Acquisition\ResumeParser\bin\lib.py", line 140, in convert_pdf
return open(output_filepath).read()
File "C:\Users\pgovindaraju\Desktop\Python_Projects\Talent-Acquisition\venv\lib\encodings\cp1252.py", line 23, in decode
return codecs.charmap_decode(input,self.errors,decoding_table)[0]
UnicodeDecodeError: 'charmap' codec can't decode byte 0x81 in position 481: character maps to

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions