To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 畯???竣???池 111110110110111100111111001111110011111110001111011101100011111100111111001111111001001001110010 fb6f3f3f3f8f763f3f3f9272
EUC-JP 畯???竣???池 10001111110011011011101100111111001111110011111110111101110101110011111100111111001111111100001111010011 8fcdbb3f3f3fbdd73f3f3fc3d3
UTF-8 畯얹렰렜竣얹렰렭池 111001111001010110101111111011001001011010111001111010111010000010110000111010111010000010011100111001111010101110100011111011001001011010111001111010111010000010110000111010111010000010101101111001101011000110100000 e795afec96b9eba0b0eba09ce7aba3ec96b9eba0b0eba0ade6b1a0
UHC 畯얹렰렜竣얹렰렭池 111100011110000110111110111100011000111010111101100011101010111011110001111000101011111011110001100011101011110110001110101110101111001010101110 f1e1bef18ebd8eaef1e2bef18ebd8ebaf2ae

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)