To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????}v????????}vB 001111110011111100111111001111110011111100111111001111110011111101111101011101100011111100111111001111110011111100111111001111110011111100111111011111010111011001000010 3f3f3f3f3f3f3f3f7d763f3f3f3f3f3f3f3f7d7642
SJIS-WIN 煽?煽粟煽?煽遡}v煽?煽粟煽?煽遡}vB 100100001111100000111111100100001111100010001000101111101001000011111000001111111001000011111000100100010110101101111101011101101001000011111000001111111001000011111000100010001011111010010000111110000011111110010000111110001001000101101011011111010111011001000010 90f83f90f888be90f83f90f8916b7d7690f83f90f888be90f83f90f8916b7d7642
EUC-JP 煽?煽粟煽?煽遡}v煽?煽粟煽?煽遡}vB 110000001111101000111111110000001111101010110000110000001100000011111010001111111100000011111010110000011100110001111101011101101100000011111010001111111100000011111010101100001100000011000000111110100011111111000000111110101100000111001100011111010111011001000010 c0fa3fc0fab0c0c0fa3fc0fac1cc7d76c0fa3fc0fab0c0c0fa3fc0fac1cc7d7642
UTF-8 煽黎煽粟煽黎煽遡}v煽黎煽粟煽黎煽遡}vB 1110011110000101101111011110111110100110100010011110011110000101101111011110011110110010100111111110011110000101101111011110111110100110100010011110011110000101101111011110100110000001101000010111110101110110111001111000010110111101111011111010011010001001111001111000010110111101111001111011001010011111111001111000010110111101111011111010011010001001111001111000010110111101111010011000000110100001011111010111011001000010 e785bdefa689e785bde7b29fe785bdefa689e785bde981a17d76e785bdefa689e785bde7b29fe785bdefa689e785bde981a17d7642
UHC 煽黎煽粟煽黎煽遡}v煽黎煽粟煽黎煽遡}vB 11100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011000111100000110000111110000111001111011111010111011011100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011000111100000110000111110000111001111011111010111011001000010 e0c3e6b1e0c3e1d8e0c3e6b1e0c3e1cf7d76e0c3e6b1e0c3e1d8e0c3e6b1e0c3e1cf7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)