To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??莊??????? 0011111100111111111001001011010100111111001111110011111100111111001111110011111100111111 3f3fe4b53f3f3f3f3f3f3f
EUC-JP 洹?莊??????? 10001111110001111011101000111111111010001011011100111111001111110011111100111111001111110011111100111111 8fc7ba3fe8b73f3f3f3f3f3f3f
UTF-8 洹렚莊렱뤯헤뀔탮₃렦 111001101011010010111001111010111010000010011010111010001000111010001010111010111010000010110001111010111010010010101111111011011001011110100100111010111000000010010100111011011000001110101110111000101000001010000011111010111010000010100110 e6b4b9eba09ae88e8aeba0b1eba4afed97a4eb8094ed83aee28283eba0a6
UHC 洹렚莊렱뤯헤뀔탮₃렦 1110101010110111100011101010110111101101111101101000111010111110100011111101110111000111111011001011001011110000101101011000111010101001111111011000111010110101 eab78eadedf68ebe8fddc7ecb2f0b58ea9fd8eb5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)