To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???U???A???M???C 00111111001111110011111101010101001111110011111100111111010000010011111100111111001111110100110100111111001111110011111101000011 3f3f3f553f3f3f413f3f3f4d3f3f3f43
SJIS-WIN 炭卒捉U炭卒捉A炭卒捉M炭卒捉C 10010010010110011001000110110010100100011010100001010101100100100101100110010001101100101001000110101000010000011001001001011001100100011011001010010001101010000100110110010010010110011001000110110010100100011010100001000011 925991b291a855925991b291a841925991b291a84d925991b291a843
EUC-JP 炭卒捉U炭卒捉A炭卒捉M炭卒捉C 11000011101110101100001010110100110000101010101001010101110000111011101011000010101101001100001010101010010000011100001110111010110000101011010011000010101010100100110111000011101110101100001010110100110000101010101001000011 c3bac2b4c2aa55c3bac2b4c2aa41c3bac2b4c2aa4dc3bac2b4c2aa43
UTF-8 炭卒捉U炭卒捉A炭卒捉M炭卒捉C 11100111100000101010110111100101100011011001001011100110100011011000100101010101111001111000001010101101111001011000110110010010111001101000110110001001010000011110011110000010101011011110010110001101100100101110011010001101100010010100110111100111100000101010110111100101100011011001001011100110100011011000100101000011 e782ade58d92e68d8955e782ade58d92e68d8941e782ade58d92e68d894de782ade58d92e68d8943
UHC 炭卒捉U炭卒捉A炭卒捉M炭卒捉C 11110111101010011111000011101111111100111011010101010101111101111010100111110000111011111111001110110101010000011111011110101001111100001110111111110011101101010100110111110111101010011111000011101111111100111011010101000011 f7a9f0eff3b555f7a9f0eff3b541f7a9f0eff3b54df7a9f0eff3b543

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)