To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ???肄??矣?? 0011111100111111001111111110001111100101001111110011111111100001111000010011111100111111 3f3f3fe3e53f3fe1e13f3f
EUC-JP ??Œ肄??矣?? 00111111001111111000111110101001101011011110011011100111001111110011111111100010111000110011111100111111 3f3f8fa9ade6e73f3fe2e33f3f
UTF-8 黎싲Œ肄믦쓩矣섑릹 1110111110100110100010011110110010001011101100101100010110010010111010001000001010000100111010111010111110100110111011001001001110101001111001111001111110100011111011001000010010010001111010111010011010111001 efa689ec8bb2c592e88284ebafa6ec93a9e79fa3ec8491eba6b9
UHC 黎싲Œ肄믦쓩矣섑릹 111001101011000110011010111010111010100010101011111011001011110110010010111010001011111010110001111010111111100010011000111011011001000010010111 e6b19aeba8abecbd92e8beb1ebf898ed9097

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)