To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 畏??意??隱??惟れ????B 100010001101100000111111001111111000100011010011001111110011111111101000101010100011111100111111100010001101001010000010111010100011111100111111001111110011111101000010 88d83f3f88d33f3fe8aa3f3f88d282ea3f3f3f3f42
EUC-JP 畏??意??隱??惟れ????B 101100001101101000111111001111111011000011010101001111110011111111110000101011000011111100111111101100001101010010100100111011000011111100111111001111110011111101000010 b0da3f3fb0d53f3ff0ac3f3fb0d4a4ec3f3f3f3f42
UTF-8 畏븍맚意㎫빊隱들댖惟れ뵩閱곗겣B 11100111100101011000111111101011101110001000110111101011101001111001101011100110100001001000111111100011100011101010101111101011101110011000101011101001100110101011000111101011100100111010010011101011100011001001011011100110100000111001111111100011100000101000110011101011101101011010100111101001100101101011000111101010101100111001011111101010101100101010001101000010 e7958febb88deba79ae6848fe38eabebb98ae99ab1eb93a4eb8c96e6839fe3828cebb5a9e996b1eab397eab2a342
UHC 畏븍맚意㎫빊隱들댖惟れ뵩閱곗겣B 11101000111001101011101011101011100100001010101011101011111100101010011111100111100101011011000011101011110111111011010111101001100010001011101011101010111011101010101011101100100101001010011111100110111100111011000011101100100000011011010101000010 e8e6baeb90aaebf2a7e795b0ebdfb5e988baeaeeaaec94a7e6f3b0ec81b542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)