To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 煽?煽粟煽?煽賞n}煽?煽粟煽?煽賞n{^ 100100001111100000111111100100001111100010001000101111101001000011111000001111111001000011111000100011111101110001101110011111011001000011111000001111111001000011111000100010001011111010010000111110000011111110010000111110001000111111011100011011100111101101011110 90f83f90f888be90f83f90f88fdc6e7d90f83f90f888be90f83f90f88fdc6e7b5e
EUC-JP 煽?煽粟煽?煽賞n}煽?煽粟煽?煽賞n{^ 110000001111101000111111110000001111101010110000110000001100000011111010001111111100000011111010101111101101111001101110011111011100000011111010001111111100000011111010101100001100000011000000111110100011111111000000111110101011111011011110011011100111101101011110 c0fa3fc0fab0c0c0fa3fc0fabede6e7dc0fa3fc0fab0c0c0fa3fc0fabede6e7b5e
UTF-8 煽黎煽粟煽歷煽賞n}煽黎煽粟煽歷煽賞n{^ 1110011110000101101111011110111110100110100010011110011110000101101111011110011110110010100111111110011110000101101111011110111110100110100011001110011110000101101111011110100010110011100111100110111001111101111001111000010110111101111011111010011010001001111001111000010110111101111001111011001010011111111001111000010110111101111011111010011010001100111001111000010110111101111010001011001110011110011011100111101101011110 e785bdefa689e785bde7b29fe785bdefa68ce785bde8b39e6e7de785bdefa689e785bde7b29fe785bdefa68ce785bde8b39e6e7b5e
UHC 煽黎煽粟煽歷煽賞n}煽黎煽粟煽歷煽賞n{^ 11100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011100011100000110000111101111111011011011011100111110111100000110000111110011010110001111000001100001111100001110110001110000011000011111001101011100011100000110000111101111111011011011011100111101101011110 e0c3e6b1e0c3e1d8e0c3e6b8e0c3dfdb6e7de0c3e6b1e0c3e1d8e0c3e6b8e0c3dfdb6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)