To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????? 00111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f
SJIS-WIN 要わ?敖??? 10010111011101101000001011101101001111111001110111000010001111110011111100111111 977682ed3f9dc23f3f3f
EUC-JP 要わ?敖??? 11001101110101111010010011101111001111111101101011000100001111110011111100111111 cdd7a4ef3fdac43f3f3f
UTF-8 要わ스敖뱄슥寧 111010001010011010000001111000111000001010001111111011001000101010100100111001101001010110010110111010111011000110000100111011001000101010100101111011111010011010101010 e8a681e3828fec8aa4e69596ebb184ec8aa5efa6aa
UHC 要わ스敖뱄슥寧 1110100110101001101010101110111110111101101110101110011111111001101110011110111110111101101110111110011110101100 e9a9aaefbdbae7f9b9efbdbbe7ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)