To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 錚敎??旭????敎??旭??? 111010000100001011111010110011010011111100111111100010001010111000111111001111110011111100111111111110101100110100111111001111111000100010101110001111110011111100111111 e842facd3f3f88ae3f3f3f3ffacd3f3f88ae3f3f3f
EUC-JP 錚???旭???????旭??? 11101111101000110011111100111111001111111011000010110000001111110011111100111111001111110011111100111111001111111011000010110000001111110011111100111111 efa33f3f3fb0b03f3f3f3f3f3f3fb0b03f3f3f
UTF-8 錚敎렟렭旭렠綎흗렣敎렟렭旭렠綎퓻 111010011000110010011010111001101001010110001110111010111010000010011111111010111010000010101101111001101001011110101101111010111010000010100000111001111011011010001110111011011001110110010111111010111010000010100011111001101001010110001110111010111010000010011111111010111010000010101101111001101001011110101101111010111010000010100000111001111011011010001110111011011001001110111011 e98c9ae6958eeba09feba0ade697adeba0a0e7b68eed9d97eba0a3e6958eeba09feba0ade697adeba0a0e7b68eed93bb
UHC 錚敎렟렭旭렠綎흗렣敎렟렭旭렠綎퓻 1110111010110110110011101110011110001110101100001000111010111010111010011110111110001110101100011110111111110010110010001110100110001110101101001100111011100111100011101011000010001110101110101110100111101111100011101011000111101111111100101100011110111111 eeb6cee78eb08ebae9ef8eb1eff2c8e98eb4cee78eb08ebae9ef8eb1eff2c7bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)