To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ???肄??矣? 00111111001111110011111111100011111001010011111100111111111000011110000100111111 3f3f3fe3e53f3fe1e13f
EUC-JP ??Œ肄??矣? 001111110011111110001111101010011010110111100110111001110011111100111111111000101110001100111111 3f3f8fa9ade6e73f3fe2e33f
UTF-8 黎싲Œ肄믦쓩矣섑 1110111110100110100010011110110010001011101100101100010110010010111010001000001010000100111010111010111110100110111011001001001110101001111001111001111110100011111011001000010010010001 efa689ec8bb2c592e88284ebafa6ec93a9e79fa3ec8491
UHC 黎싲Œ肄믦쓩矣섑 11100110101100011001101011101011101010001010101111101100101111011001001011101000101111101011000111101011111110001001100011101101 e6b19aeba8abecbd92e8beb1ebf898ed

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)