To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????}????????????????B 00111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ????}????????????????B 00111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
EUC-JP ????}????????????????B 00111111001111110011111100111111011111010011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f7d3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
UTF-8 셔렭렼섣}셔샴셍섭셔샬셔롘렽샬셔롘렽렾렼샵B 1110110010000101100101001110101110100000101011011110101110100000101111001110110010000100101000110111110111101100100001011001010011101100100000111011010011101100100001011000110111101100100001001010110111101100100001011001010011101100100000111010110011101100100001011001010011101011101000011001100011101011101000001011110111101100100000111010110011101100100001011001010011101011101000011001100011101011101000001011110111101011101000001011111011101011101000001011110011101100100000111011010101000010 ec8594eba0adeba0bcec84a37dec8594ec83b4ec858dec84adec8594ec83acec8594eba198eba0bdec83acec8594eba198eba0bdeba0beeba0bcec83b542
UHC 셔렭렼섣}셔샴셍섭셔샬셔롘렽샬셔롘렽렾렼샵B 101111001100010110001110101110101000111011000100101111001011001001111101101111001100010110111100101001001011110011000100101111001011011110111100110001011011110010100011101111001100010110001110110111001000111011000101101111001010001110111100110001011000111011011100100011101100010110001110110001101000111011000100101111001010010101000010 bcc58eba8ec4bcb27dbcc5bca4bcc4bcb7bcc5bca3bcc58edc8ec5bca3bcc58edc8ec58ec68ec4bca542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)