To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 午??援ο?矣??曜??午??援ο?矣??曜??^ 1000110011011111001111110011111110001001100001111000001111001101001111111110000111100001001111110011111110010111011010100011111100111111100011001101111100111111001111111000100110000111100000111100110100111111111000011110000100111111001111111001011101101010001111110011111101011110 8cdf3f3f898783cd3fe1e13f3f976a3f3f8cdf3f3f898783cd3fe1e13f3f976a3f3f5e
EUC-JP 午??援ο?矣??曜??午??援ο?矣??曜??^ 1011100011100001001111110011111110110001111001111010011011001111001111111110001011100011001111110011111111001101110010110011111100111111101110001110000100111111001111111011000111100111101001101100111100111111111000101110001100111111001111111100110111001011001111110011111101011110 b8e13f3fb1e7a6cf3fe2e33f3fcdcb3f3fb8e13f3fb1e7a6cf3fe2e33f3fcdcb3f3f5e
UTF-8 午닿퓭援ο㎖矣섎짎曜깆찆午닿퓭援ο㎖矣섎짎曜깆찆^ 1110010110001101100010001110101110001011101111111110110110010011101011011110011010001111101101001100111010111111111000111000111010010110111001111001111110100011111011001000010010001110111011001010011110001110111001101001101110011100111010101011100110000110111011001011000010000110111001011000110110001000111010111000101110111111111011011001001110101101111001101000111110110100110011101011111111100011100011101001011011100111100111111010001111101100100001001000111011101100101001111000111011100110100110111001110011101010101110011000011011101100101100001000011001011110 e58d88eb8bbfed93ade68fb4cebfe38e96e79fa3ec848eeca78ee69b9ceab986ecb086e58d88eb8bbfed93ade68fb4cebfe38e96e79fa3ec848eeca78ee69b9ceab986ecb0865e
UHC 午닿퓭援ο㎖矣섎짎曜깆찆午닿퓭援ο㎖矣섎짎曜깆찆^ 11100111111011011011010011101010101111111001010011101010101101011010010111101111101001111010001011101011111110001001100011101011101000111001101011101000111110001011000111101100101010011000101011100111111011011011010011101010101111111001010011101010101101011010010111101111101001111010001011101011111110001001100011101011101000111001101011101000111110001011000111101100101010011000101001011110 e7edb4eabf94eab5a5efa7a2ebf898eba39ae8f8b1eca98ae7edb4eabf94eab5a5efa7a2ebf898eba39ae8f8b1eca98a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)