To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 鳶??訝??訝??渦? 100100111100111000111111001111111110011001100010001111110011111111100110011000100011111100111111100010010101000100111111 93ce3f3fe6623f3fe6623f3f89513f
EUC-JP 鳶??訝??訝??渦? 110001101101000000111111001111111110101111000011001111110011111111101011110000110011111100111111101100011011001000111111 c6d03f3febc33f3febc33f3fb1b23f
UTF-8 鳶멩쓳訝밭궘訝배쥤渦겼 111010011011001110110110111010111010100110101001111011001001001110110011111010001010100010011101111010111011000010101101111010101011011010011000111010001010100010011101111010111011000010110000111011001010010110100100111001101011100010100110111010101011001010111100 e9b3b6eba9a9ec93b3e8a89debb0adeab698e8a89debb0b0eca5a4e6b8a6eab2bc
UHC 鳶멩쓳訝밭궘訝배쥤渦겼 11100110111010011011100011100110100111011001000111100100101110001011100111100111100000101010110111100100101110001011100111101000101000101001011011101000101111101011000011100101 e6e9b8e69d91e4b8b9e782ade4b8b9e8a296e8beb0e5

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)