Channel: Planet Python

↧

Python Morsels: Unicode character encodings

May 2, 2022, 8:00 am

≫ Next: Glyph Lefkowitz: Inbox Zero, Cost: Zero

≪ Previous: Real Python: Python's min() and max(): Find Smallest and Largest Values

When working with text files in Python, it's considered a best practice to specify the character encoding that you're working with.

Table of contents

All input starts as raw bytes

When you open a file in Python, the default mode is r or rt, for read text mode:

>>> withopen("my_file.txt")asf:... contents=f.read()...>>> f.mode'r'

>>> withopen("my_file.txt")asf:... contents=f.read()...>>> f.mode'r'

Meaning when we read our file, we'll get back strings that represent text:

>>> contents'This is a file ✨\n'

>>> contents'This is a file ✨\n'

But that's not what Python actually reads from disk.

If we open a file with the mode rb and read from our file we'll see what Python sees; that is bytes:

>>> withopen("my_file.txt",mode="rb")asf:... contents=f.read()...>>> contentsb'This is a file \xe2\x9c\xa8\n'>>> type(contents)<class 'bytes'>

>>> withopen("my_file.txt",mode="rb")asf:... contents=f.read()...>>> contentsb'This is a file \xe2\x9c\xa8\n'>>> type(contents)<class 'bytes'>

Bytes are what Python decodes to make strings.

Encoding strings into bytes

If you have a string …

Read the full article: https://www.pythonmorsels.com/unicode-character-encodings-in-python/

↧

Trending Articles

RAMAYAMPET Mandal Sarpanch | Upa-Sarpanch | Ward member Mobile Numbers Medak...

May 24, 2017, 2:00 am

लड़कियां सेक्स के दौरान क्यों करती है उह! आह!लड़कियां सेक्स के दौरान क्यों करती...

May 19, 2016, 1:54 am

Neem Baba Extra Questions Answer Class 6 English Poorvi

February 1, 2025, 5:19 am

Throw Back: 4×4 — Sikilitele (Ft Castro) Prod by JQ

March 5, 2015, 8:24 am

Rajasthan Board 10th Result 2016 Roll No wise & Name Wise

August 20, 2016, 5:13 pm

Lowe faces four theft charges

November 14, 2017, 6:52 pm

Practice Sheet of Right form of verbs for HSC Students

September 22, 2019, 11:40 pm

Mafia, Murder & Mayhem In The Motor City: Detroit Mob Hit Timeline (1937-2007)

December 7, 2016, 3:57 pm

The 10 Tennessee Cities With The Largest Black Population For 2021

December 21, 2020, 10:12 am

Materials Around Us Class 6 Worksheet Science Chapter 6

October 3, 2024, 5:20 am

デスクトップヒープの枯渇

January 18, 2018, 8:31 pm

Best Suvichar in Hindi |बेस्ट सुविचार |शुभ विचार हिंदी में

March 7, 2020, 11:19 pm

Kanulanu Thaake Lyrics and translation | Manam (2014)

May 9, 2014, 5:45 am

Korean Sex Porn Videos: XXX Videos & Free Porn Movies

May 30, 2025, 9:29 pm

Teen Shot In Miami Drive-By Dies From Injuries

August 8, 2011, 1:16 pm

Download: IQ Muzatasha feat Shy D & Pmj – Ulesi NiFertilizer Yamavuto

March 22, 2018, 7:23 pm

Mahakal Attitude Status

February 29, 2020, 9:52 am

Property developer set up cannabis factory to help pay off debts...

August 3, 2015, 2:29 am

♡

July 11, 2015, 6:15 am

KB: How to troubleshoot issues when adding a Hyper-V host in System Center...

August 14, 2012, 10:05 am

© 2026 //www.rssing.com