To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???H???@???@|???F???AB 00111111001111110011111101001000001111110011111100111111010000000011111100111111001111110100000001111100001111110011111100111111010001100011111100111111001111110100000101000010 3f3f3f483f3f3f403f3f3f407c3f3f3f463f3f3f4142
SJIS-WIN 炭卒捉H炭卒捉@炭卒捉@|炭卒捉F炭卒捉AB 10010010010110011001000110110010100100011010100001001000100100100101100110010001101100101001000110101000010000001001001001011001100100011011001010010001101010000100000001111100100100100101100110010001101100101001000110101000010001101001001001011001100100011011001010010001101010000100000101000010 925991b291a848925991b291a840925991b291a8407c925991b291a846925991b291a84142
EUC-JP 炭卒捉H炭卒捉@炭卒捉@|炭卒捉F炭卒捉AB 11000011101110101100001010110100110000101010101001001000110000111011101011000010101101001100001010101010010000001100001110111010110000101011010011000010101010100100000001111100110000111011101011000010101101001100001010101010010001101100001110111010110000101011010011000010101010100100000101000010 c3bac2b4c2aa48c3bac2b4c2aa40c3bac2b4c2aa407cc3bac2b4c2aa46c3bac2b4c2aa4142
UTF-8 炭卒捉H炭卒捉@炭卒捉@|炭卒捉F炭卒捉AB 11100111100000101010110111100101100011011001001011100110100011011000100101001000111001111000001010101101111001011000110110010010111001101000110110001001010000001110011110000010101011011110010110001101100100101110011010001101100010010100000001111100111001111000001010101101111001011000110110010010111001101000110110001001010001101110011110000010101011011110010110001101100100101110011010001101100010010100000101000010 e782ade58d92e68d8948e782ade58d92e68d8940e782ade58d92e68d89407ce782ade58d92e68d8946e782ade58d92e68d894142
UHC 炭卒捉H炭卒捉@炭卒捉@|炭卒捉F炭卒捉AB 11110111101010011111000011101111111100111011010101001000111101111010100111110000111011111111001110110101010000001111011110101001111100001110111111110011101101010100000001111100111101111010100111110000111011111111001110110101010001101111011110101001111100001110111111110011101101010100000101000010 f7a9f0eff3b548f7a9f0eff3b540f7a9f0eff3b5407cf7a9f0eff3b546f7a9f0eff3b54142

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)