To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h 00111111001111110011111100111111001111110011111100111111001111110011111101101000 3f3f3f3f3f3f3f3f3f68
SJIS-WIN 暗??誼?┥???h 10001000110000110011111100111111100010110110001000111111100001001011110000111111001111110011111101101000 88c33f3f8b623f84bc3f3f3f68
EUC-JP 暗??誼?┥???h 10110000110001010011111100111111101101011100001100111111101010001011111000111111001111110011111101101000 b0c53f3fb5c33fa8be3f3f3f68
UTF-8 暗삳쉴誼쀯┥栒듬솤h 11100110100110101001011111101100100000101011001111101100100010011011010011101000101010101011110011101100100000001010111111100010100101001010010111100110101000001001001011101011100100111010110011101100100001101010010001101000 e69a97ec82b3ec89b4e8aabcec80afe294a5e6a092eb93acec86a468
UHC 暗삳쉴誼쀯┥栒듬솤h 11100100110111101011101111101011101111011010111111101011111111101001011111101111101001101011111011100010111000111011010111101011100110011001111001101000 e4debbebbdafebfe97efa6bee2e3b5eb999e68

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)