To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 鸚??意??蟻??猿?????B 1110101001011111001111110011111110001000110100110011111100111111100010110110000100111111001111111000100110001110001111110011111100111111001111110011111101000010 ea5f3f3f88d33f3f8b613f3f898e3f3f3f3f3f42
EUC-JP 鸚??意??蟻??猿??孼??B 11110011110000000011111100111111101100001101010100111111001111111011010111000010001111110011111110110001111011100011111100111111100011111011101011000011001111110011111101000010 f3c03f3fb0d53f3fb5c23f3fb1ee3f3f8fbac33f3f42
UTF-8 鸚쒓퍓意쎿룚蟻얍쩂猿딆뵛孼뽰왍B 11101001101110001001101011101100100100101001001111101101100011011001001111100110100001001000111111101100100011101011111111101011101000111001101011101000100111111011101111101100100101101000110111101100101010011000001011100111100011001011111111101011100101001000011011101011101101011001101111100101101011011011110011101011101111011011000011101100100110011000110101000010 e9b89aec9293ed8d93e6848fec8ebfeba39ae89fbbec968deca982e78cbfeb9486ebb59be5adbcebbdb0ec998d42
UHC 鸚쒓퍓意쎿룚蟻얍쩂猿딆뵛孼뽰왍B 11100101101001001001110011101010101110111000101011101011111100101001101111100110100011111001011011101011111111001011111011100101101001001001110011101010101110111000101011101100100101001001101111100101111011011001011011101100100111101011111001000010 e5a49ceabb8aebf29be68f96ebfcbee5a49ceabb8aec949be5ed96ec9ebe42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)