To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????B 001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f42
SJIS-WIN 仲?海?仲?海?B 10010010100001110011111110001010010000110011111110010010100001110011111110001010010000110011111101000010 92873f8a433f92873f8a433f42
EUC-JP 仲?海?仲?海?B 11000011111001110011111110110011101001000011111111000011111001110011111110110011101001000011111101000010 c3e73fb3a43fc3e73fb3a43f42
UTF-8 仲렫海렟仲렫海렟B 11100100101110111011001011101011101000001010101111100110101101011011011111101011101000001001111111100100101110111011001011101011101000001010101111100110101101011011011111101011101000001001111101000010 e4bbb2eba0abe6b5b7eba09fe4bbb2eba0abe6b5b7eba09f42
UHC 仲렫海렟仲렫海렟B 1111000111101010100011101011100111111010101011011000111010110000111100011110101010001110101110011111101010101101100011101011000001000010 f1ea8eb9faad8eb0f1ea8eb9faad8eb042

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)