To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????W}????????W{^ 001111110011111100111111001111110011111100111111001111110011111101010111011111010011111100111111001111110011111100111111001111110011111100111111010101110111101101011110 3f3f3f3f3f3f3f3f577d3f3f3f3f3f3f3f3f577b5e
SJIS-WIN 煽?煽粟煽?煽遡W}煽?煽粟煽?煽遡W{^ 100100001111100000111111100100001111100010001000101111101001000011111000001111111001000011111000100100010110101101010111011111011001000011111000001111111001000011111000100010001011111010010000111110000011111110010000111110001001000101101011010101110111101101011110 90f83f90f888be90f83f90f8916b577d90f83f90f888be90f83f90f8916b577b5e
EUC-JP 煽?煽粟煽?煽遡W}煽?煽粟煽?煽遡W{^ 110000001111101000111111110000001111101010110000110000001100000011111010001111111100000011111010110000011100110001010111011111011100000011111010001111111100000011111010101100001100000011000000111110100011111111000000111110101100000111001100010101110111101101011110 c0fa3fc0fab0c0c0fa3fc0fac1cc577dc0fa3fc0fab0c0c0fa3fc0fac1cc577b5e
UTF-8 煽黎煽粟煽黎煽遡W}煽黎煽粟煽黎煽遡W{^ 1110011110000101101111011110111110100110100010011110011110000101101111011110011110110010100111111110011110000101101111011110111110100110100010011110011110000101101111011110100110000001101000010101011101111101111001111000010110111101111011111010011010001001111001111000010110111101111001111011001010011111111001111000010110111101111011111010011010001001111001111000010110111101111010011000000110100001010101110111101101011110 e785bdefa689e785bde7b29fe785bdefa689e785bde981a1577de785bdefa689e785bde7b29fe785bdefa689e785bde981a1577b5e
UHC 煽黎煽粟煽黎煽遡W}煽黎煽粟煽黎煽遡W{^ 11100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011000111100000110000111110000111001111010101110111110111100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011000111100000110000111110000111001111010101110111101101011110 e0c3e6b1e0c3e1d8e0c3e6b1e0c3e1cf577de0c3e6b1e0c3e1d8e0c3e6b1e0c3e1cf577b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)