To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ?????????淫 0011111100111111001111110011111100111111001111110011111100111111001111111000100011111010 3f3f3f3f3f3f3f3f3f88fa
EUC-JP 渶????????淫 10001111110001111110110100111111001111110011111100111111001111110011111100111111001111111011000011111100 8fc7ed3f3f3f3f3f3f3f3fb0fc
UTF-8 渶앸죳琉븀넞理띷츝淫 111001101011100010110110111011001001010110111000111011001010001110110011111011111010011110001100111010111011100010000000111010111000010010011110111011111010011110100100111010111001110110110111111011001011100010011101111001101011011110101011 e6b8b6ec95b8eca3b3efa78cebb880eb849eefa7a4eb9db7ecb89de6b7ab
UHC 渶앸죳琉븀넞理띷츝淫 1110011110110111100111011110101110100001100011101110101110100100101110101110011110000110101000101110110010110101100011011110011010101110100101101110101111100010 e7b79deba18eeba4bae786a2ecb58de6ae96ebe2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)