To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???Z?????????O???B^ 00111111001111110011111101011010001111110011111100111111001111110011111100111111001111110011111100111111010011110011111100111111001111110100001001011110 3f3f3f5a3f3f3f3f3f3f3f3f3f4f3f3f3f425e
SJIS-WIN 炭卒捉Z炭卒息炭卒尊炭卒捉O炭卒捉B^ 10010010010110011001000110110010100100011010100001011010100100100101100110010001101100101001000110100111100100100101100110010001101100101001000110111000100100100101100110010001101100101001000110101000010011111001001001011001100100011011001010010001101010000100001001011110 925991b291a85a925991b291a7925991b291b8925991b291a84f925991b291a8425e
EUC-JP 炭卒捉Z炭卒息炭卒尊炭卒捉O炭卒捉B^ 11000011101110101100001010110100110000101010101001011010110000111011101011000010101101001100001010101001110000111011101011000010101101001100001010111010110000111011101011000010101101001100001010101010010011111100001110111010110000101011010011000010101010100100001001011110 c3bac2b4c2aa5ac3bac2b4c2a9c3bac2b4c2bac3bac2b4c2aa4fc3bac2b4c2aa425e
UTF-8 炭卒捉Z炭卒息炭卒尊炭卒捉O炭卒捉B^ 11100111100000101010110111100101100011011001001011100110100011011000100101011010111001111000001010101101111001011000110110010010111001101000000110101111111001111000001010101101111001011000110110010010111001011011000010001010111001111000001010101101111001011000110110010010111001101000110110001001010011111110011110000010101011011110010110001101100100101110011010001101100010010100001001011110 e782ade58d92e68d895ae782ade58d92e681afe782ade58d92e5b08ae782ade58d92e68d894fe782ade58d92e68d89425e
UHC 炭卒捉Z炭卒息炭卒尊炭卒捉O炭卒捉B^ 11110111101010011111000011101111111100111011010101011010111101111010100111110000111011111110001111010011111101111010100111110000111011111111000011101110111101111010100111110000111011111111001110110101010011111111011110101001111100001110111111110011101101010100001001011110 f7a9f0eff3b55af7a9f0efe3d3f7a9f0eff0eef7a9f0eff3b54ff7a9f0eff3b5425e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)