To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ??杖?珥?怨?n}??杖?珥?怨?n{^ 001111110011111110001111111100010011111111100000111000000011111110001001100001010011111101101110011111010011111100111111100011111111000100111111111000001110000000111111100010011000010100111111011011100111101101011110 3f3f8ff13fe0e03f89853f6e7d3f3f8ff13fe0e03f89853f6e7b5e
EUC-JP 檉?杖?珥?怨?n}檉?杖?珥?怨?n{^ 10001111110001011011101100111111101111101111001100111111111000001110001000111111101100011110010100111111011011100111110110001111110001011011101100111111101111101111001100111111111000001110001000111111101100011110010100111111011011100111101101011110 8fc5bb3fbef33fe0e23fb1e53f6e7d8fc5bb3fbef33fe0e23fb1e53f6e7b5e
UTF-8 檉렢杖렱珥렮怨렊n}檉렢杖렱珥렮怨렊n{^ 1110011010101010100010011110101110100000101000101110011010011101100101101110101110100000101100011110011110001111101001011110101110100000101011101110011010000000101010001110101110100000100010100110111001111101111001101010101010001001111010111010000010100010111001101001110110010110111010111010000010110001111001111000111110100101111010111010000010101110111001101000000010101000111010111010000010001010011011100111101101011110 e6aa89eba0a2e69d96eba0b1e78fa5eba0aee680a8eba08a6e7de6aa89eba0a2e69d96eba0b1e78fa5eba0aee680a8eba08a6e7b5e
UHC 檉렢杖렱珥렮怨렊n}檉렢杖렱珥렮怨렊n{^ 11101111111000001000111010110011111011011110100010001110101111101110110010110100100011101011101111101010101100111000111010100001011011100111110111101111111000001000111010110011111011011110100010001110101111101110110010110100100011101011101111101010101100111000111010100001011011100111101101011110 efe08eb3ede88ebeecb48ebbeab38ea16e7defe08eb3ede88ebeecb48ebbeab38ea16e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)