To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????Lh?????????L 001111110011111100111111001111110011111100111111001111110011111100111111010011000110100000111111001111110011111100111111001111110011111100111111001111110011111101001100 3f3f3f3f3f3f3f3f3f4c683f3f3f3f3f3f3f3f3f4c
SJIS-WIN 猷?4徇?6二??Lh猷?4徇?6二??L 10010111010100010011111110000010010100111001110001101101001111111000001001010101100100111111000100111111001111110100110001101000100101110101000100111111100000100101001110011100011011010011111110000010010101011001001111110001001111110011111101001100 97513f82539c6d3f825593f13f3f4c6897513f82539c6d3f825593f13f3f4c
EUC-JP 猷?4徇?6二??Lh猷?4徇?6二??L 11001101101100100011111110100011101101001101011111001110001111111010001110110110110001101111001100111111001111110100110001101000110011011011001000111111101000111011010011010111110011100011111110100011101101101100011011110011001111110011111101001100 cdb23fa3b4d7ce3fa3b6c6f33f3f4c68cdb23fa3b4d7ce3fa3b6c6f33f3f4c
UTF-8 猷띠4徇깅6二담궙Lh猷띠4徇깅6二담궙L 111001111000110010110111111010111001110110100000111011111011110010010100111001011011111010000111111010101011100110000101111011111011110010010110111001001011101010001100111010111000101110110100111010101011011010011001010011000110100011100111100011001011011111101011100111011010000011101111101111001001010011100101101111101000011111101010101110011000010111101111101111001001011011100100101110101000110011101011100010111011010011101010101101101001100101001100 e78cb7eb9da0efbc94e5be87eab985efbc96e4ba8ceb8bb4eab6994c68e78cb7eb9da0efbc94e5be87eab985efbc96e4ba8ceb8bb4eab6994c
UHC 猷띠4徇깅6二담궙Lh猷띠4徇깅6二담궙L 111010111010001110110110111011001010001110110100111000101101111110110001111010111010001110110110111011001010001110110100111000111000001010101110010011000110100011101011101000111011011011101100101000111011010011100010110111111011000111101011101000111011011011101100101000111011010011100011100000101010111001001100 eba3b6eca3b4e2dfb1eba3b6eca3b4e382ae4c68eba3b6eca3b4e2dfb1eba3b6eca3b4e382ae4c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)