To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???鴦??淹??怨??濡??B 0011111100111111001111111110100111110001001111110011111110011111101110010011111100111111100010011000010100111111001111111001010001000111001111110011111101000010 3f3f3fe9f13f3f9fb93f3f89853f3f94473f3f42
EUC-JP ???鴦??淹??怨??濡??B 0011111100111111001111111111001011110011001111110011111111011110101110110011111100111111101100011110010100111111001111111100011110101000001111110011111101000010 3f3f3ff2f33f3fdebb3f3fb1e53f3fc7a83f3f42
UTF-8 曆뤾낵鴦잙죯淹쎿톽怨쇔렅濡딀뼓B 11101111101001101000101111101011101001001011111011101011100000101011010111101001101101001010011011101100100111101001100111101100101000111010111111100110101101111011100111101100100011101011111111101101100001101011110111100110100000001010100011101100100001111001010011101011101000001000010111100110101111111010000111101011100101001000000011101011101111001001001101000010 efa68beba4beeb82b5e9b4a6ec9e99eca3afe6b7b9ec8ebfed86bde680a8ec8794eba085e6bfa1eb9480ebbc9342
UHC 曆뤾낵鴦잙죯淹쎿톽怨쇔렅濡딀뼓B 11100110101101111000111111101010101100111011110011100100111011001001111111101011101000011000101011100101111101001001101111100110101101111000111111101010101100111011110011100101100011101001111111101011101000011000101011100110100101101001101101000010 e6b78feab3bce4ec9feba18ae5f49be6b78feab3bce58e9feba18ae6969b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)