To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???D??????i???D??????iB 0011111100111111001111110100010000111111001111110011111100111111001111110011111101101001001111110011111100111111010001000011111100111111001111110011111100111111001111110110100101000010 3f3f3f443f3f3f3f3f3f693f3f3f443f3f3f3f3f3f6942
SJIS-WIN ???D??????i???D??????iB 0011111100111111001111110100010000111111001111110011111100111111001111110011111101101001001111110011111100111111010001000011111100111111001111110011111100111111001111110110100101000010 3f3f3f443f3f3f3f3f3f693f3f3f443f3f3f3f3f3f6942
EUC-JP ???D??????i???D??????iB 0011111100111111001111110100010000111111001111110011111100111111001111110011111101101001001111110011111100111111010001000011111100111111001111110011111100111111001111110110100101000010 3f3f3f443f3f3f3f3f3f693f3f3f443f3f3f3f3f3f6942
UTF-8 淋귦삪D淋꾪샋淋꾩찎i淋귦삪D淋꾪샋淋꾩찎iB 1110111110100111101101011110101010110111101001101110110010000010101010100100010011101111101001111011010111101010101111101010101011101100100000111000101111101111101001111011010111101010101111101010100111101100101100001000111001101001111011111010011110110101111010101011011110100110111011001000001010101010010001001110111110100111101101011110101010111110101010101110110010000011100010111110111110100111101101011110101010111110101010011110110010110000100011100110100101000010 efa7b5eab7a6ec82aa44efa7b5eabeaaec838befa7b5eabea9ecb08e69efa7b5eab7a6ec82aa44efa7b5eabeaaec838befa7b5eabea9ecb08e6942
UHC 淋귦삪D淋꾪샋淋꾩찎i淋귦삪D淋꾪샋淋꾩찎iB 1110110011111000100000101110110110011000101010010100010011101100111110001000010011101101100110001011101011101100111110001000010011101100101010011001000001101001111011001111100010000010111011011001100010101001010001001110110011111000100001001110110110011000101110101110110011111000100001001110110010101001100100000110100101000010 ecf882ed98a944ecf884ed98baecf884eca99069ecf882ed98a944ecf884ed98baecf884eca9906942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)