To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ???漫消羈劇黔 00111111001111110011111110010110100111111000111111000001111000111011000110001100100000001110101001110111 3f3f3f969f8fc1e3b18c80ea77
EUC-JP ???漫消羈劇黔 00111111001111110011111111001100101000011011111011000011111001101011001110110111111000001111001111011000 3f3f3fcca1bec3e6b3b7e0f3d8
UTF-8 솽솅뤵漫消羈劇黔 111011001000011010111101111011001000011010000101111010111010010010110101111001101011110010101011111001101011011010001000111001111011111010001000111001011000101010000111111010011011101110010100 ec86bdec8685eba4b5e6bcabe6b688e7be88e58a87e9bb94
UHC 솽솅뤵漫消羈劇黔 10111100111000011011110011010001100011111110001111011000101111001110000110111100110100011011110011010000101111001100110010100011 bce1bcd18fe3d8bce1bcd1bcd0bccca3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)