To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????B 0011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??8爾???8爾?B 001111110011111110000010010101111000111010100010001111110011111100111111100000100101011110001110101000100011111101000010 3f3f82578ea23f3f3f82578ea23f42
EUC-JP ??8爾???8爾?B 001111110011111110100011101110001011110010100100001111110011111100111111101000111011100010111100101001000011111101000010 3f3fa3b8bca43f3f3fa3b8bca43f42
UTF-8 銳얜8爾춟銳얜8爾춟B 11101001100010101011001111101100100101101001110011101111101111001001100011100111100010001011111011101100101101101001111111101001100010101011001111101100100101101001110011101111101111001001100011100111100010001011111011101100101101101001111101000010 e98ab3ec969cefbc98e788beecb69fe98ab3ec969cefbc98e788beecb69f42
UHC 銳얜8爾춟銳얜8爾춟B 111001111110010110111110111010111010001110111000111011001011001110101101011110101110011111100101101111101110101110100011101110001110110010110011101011010111101001000010 e7e5beeba3b8ecb3ad7ae7e5beeba3b8ecb3ad7a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)