To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ??屑堯??屑跣n}??屑堯??屑跣n{^ 0011111100111111100010111111101111101010100111110011111100111111100010111111101111100110111011110110111001111101001111110011111110001011111110111110101010011111001111110011111110001011111110111110011011101111011011100111101101011110 3f3f8bfbea9f3f3f8bfbe6ef6e7d3f3f8bfbea9f3f3f8bfbe6ef6e7b5e
EUC-JP 炤?屑堯炤?屑跣n}炤?屑堯炤?屑跣n{^ 10001111110010011101001000111111101101101111110111110100101000011000111111001001110100100011111110110110111111011110110011110001011011100111110110001111110010011101001000111111101101101111110111110100101000011000111111001001110100100011111110110110111111011110110011110001011011100111101101011110 8fc9d23fb6fdf4a18fc9d23fb6fdecf16e7d8fc9d23fb6fdf4a18fc9d23fb6fdecf16e7b5e
UTF-8 炤숸屑堯炤숸屑跣n}炤숸屑堯炤숸屑跣n{^ 1110011110000010101001001110110010001000101110001110010110110001100100011110010110100000101011111110011110000010101001001110110010001000101110001110010110110001100100011110100010110111101000110110111001111101111001111000001010100100111011001000100010111000111001011011000110010001111001011010000010101111111001111000001010100100111011001000100010111000111001011011000110010001111010001011011110100011011011100111101101011110 e782a4ec88b8e5b191e5a0afe782a4ec88b8e5b191e8b7a36e7de782a4ec88b8e5b191e5a0afe782a4ec88b8e5b191e8b7a36e7b5e
UHC 炤숸屑堯炤숸屑跣n}炤숸屑堯炤숸屑跣n{^ 11100001101111111001101001001101111000001101101011101000111010111110000110111111100110100100110111100000110110101110000011010011011011100111110111100001101111111001101001001101111000001101101011101000111010111110000110111111100110100100110111100000110110101110000011010011011011100111101101011110 e1bf9a4de0dae8ebe1bf9a4de0dae0d36e7de1bf9a4de0dae8ebe1bf9a4de0dae0d36e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)