To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????V}???????V{^ 00111111001111110011111100111111001111110011111100111111010101100111110100111111001111110011111100111111001111110011111100111111010101100111101101011110 3f3f3f3f3f3f3f567d3f3f3f3f3f3f3f567b5e
SJIS-WIN 炭卒息綻炭卒捉V}炭卒息綻炭卒捉V{^ 100100100101100110010001101100101001000110100111100100100101110110010010010110011001000110110010100100011010100001010110011111011001001001011001100100011011001010010001101001111001001001011101100100100101100110010001101100101001000110101000010101100111101101011110 925991b291a7925d925991b291a8567d925991b291a7925d925991b291a8567b5e
EUC-JP 炭卒息綻炭卒捉V}炭卒息綻炭卒捉V{^ 110000111011101011000010101101001100001010101001110000111011111011000011101110101100001010110100110000101010101001010110011111011100001110111010110000101011010011000010101010011100001110111110110000111011101011000010101101001100001010101010010101100111101101011110 c3bac2b4c2a9c3bec3bac2b4c2aa567dc3bac2b4c2a9c3bec3bac2b4c2aa567b5e
UTF-8 炭卒息綻炭卒捉V}炭卒息綻炭卒捉V{^ 1110011110000010101011011110010110001101100100101110011010000001101011111110011110110110101110111110011110000010101011011110010110001101100100101110011010001101100010010101011001111101111001111000001010101101111001011000110110010010111001101000000110101111111001111011011010111011111001111000001010101101111001011000110110010010111001101000110110001001010101100111101101011110 e782ade58d92e681afe7b6bbe782ade58d92e68d89567de782ade58d92e681afe7b6bbe782ade58d92e68d89567b5e
UHC 炭卒息綻炭卒捉V}炭卒息綻炭卒捉V{^ 111101111010100111110000111011111110001111010011111101111010101011110111101010011111000011101111111100111011010101010110011111011111011110101001111100001110111111100011110100111111011110101010111101111010100111110000111011111111001110110101010101100111101101011110 f7a9f0efe3d3f7aaf7a9f0eff3b5567df7a9f0efe3d3f7aaf7a9f0eff3b5567b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)