To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????i???????????iB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f3f6942
SJIS-WIN ???????????i???????????iB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f3f6942
EUC-JP ???????????i???????????iB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110110100101000010 3f3f3f3f3f3f3f3f3f3f3f693f3f3f3f3f3f3f3f3f3f3f6942
UTF-8 셔샵셔섈셍렣렽샹셍렢렡i셔샵셔섈셍렣렽샹셍렢렡iB 111011001000010110010100111011001000001110110101111011001000010110010100111011001000010010001000111011001000010110001101111010111010000010100011111010111010000010111101111011001000001110111001111011001000010110001101111010111010000010100010111010111010000010100001011010011110110010000101100101001110110010000011101101011110110010000101100101001110110010000100100010001110110010000101100011011110101110100000101000111110101110100000101111011110110010000011101110011110110010000101100011011110101110100000101000101110101110100000101000010110100101000010 ec8594ec83b5ec8594ec8488ec858deba0a3eba0bdec83b9ec858deba0a2eba0a169ec8594ec83b5ec8594ec8488ec858deba0a3eba0bdec83b9ec858deba0a2eba0a16942
UHC 셔샵셔섈셍렣렽샹셍렢렡i셔샵셔섈셍렣렽샹셍렢렡iB 1011110011000101101111001010010110111100110001011011110010101010101111001100010010001110101101001000111011000101101111001010011110111100110001001000111010110011100011101011001001101001101111001100010110111100101001011011110011000101101111001010101010111100110001001000111010110100100011101100010110111100101001111011110011000100100011101011001110001110101100100110100101000010 bcc5bca5bcc5bcaabcc48eb48ec5bca7bcc48eb38eb269bcc5bca5bcc5bcaabcc48eb48ec5bca7bcc48eb38eb26942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)