To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 猷??猷??循?+??猷??猷??循?+??B 10010111010100010011111100111111100101110101000100111111001111111000111101111010001111111000000101111011001111110011111110010111010100010011111100111111100101110101000100111111001111111000111101111010001111111000000101111011001111110011111101000010 97513f3f97513f3f8f7a3f817b3f3f97513f3f97513f3f8f7a3f817b3f3f42
EUC-JP 猷??猷??循?+??猷??猷??循?+??B 11001101101100100011111100111111110011011011001000111111001111111011110111011011001111111010000111011100001111110011111111001101101100100011111100111111110011011011001000111111001111111011110111011011001111111010000111011100001111110011111101000010 cdb23f3fcdb23f3fbddb3fa1dc3f3fcdb23f3fcdb23f3fbddb3fa1dc3f3f42
UTF-8 猷댁갯猷듯굛循뀀+李퉨猷댁갯猷듯굛循뀀+李퉨B 11100111100011001011011111101011100011001000000111101010101100001010111111100111100011001011011111101011100100111010111111101010101101011001101111100101101111101010101011101011100000001000000011101111101111001000101111101111101001111010000111101101100010011010100011100111100011001011011111101011100011001000000111101010101100001010111111100111100011001011011111101011100100111010111111101010101101011001101111100101101111101010101011101011100000001000000011101111101111001000101111101111101001111010000111101101100010011010100001000010 e78cb7eb8c81eab0afe78cb7eb93afeab59be5beaaeb8080efbc8befa7a1ed89a8e78cb7eb8c81eab0afe78cb7eb93afeab59be5beaaeb8080efbc8befa7a1ed89a842
UHC 猷댁갯猷듯굛循뀀+李퉨猷댁갯猷듯굛循뀀+李퉨B 111010111010001110110100111011001011000010111001111010111010001110110101111011011000001010000011111000101110000010110010111010111010001110101011111011001011000010111001011110101110101110100011101101001110110010110000101110011110101110100011101101011110110110000010100000111110001011100000101100101110101110100011101010111110110010110000101110010111101001000010 eba3b4ecb0b9eba3b5ed8283e2e0b2eba3abecb0b97aeba3b4ecb0b9eba3b5ed8283e2e0b2eba3abecb0b97a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)