To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 巖??諶??巖??諶??巖??諶??巖??諶??B 100110111101110000111111001111111111101110101010001111110011111110011011110111000011111100111111111110111010101000111111001111111001101111011100001111110011111111111011101010100011111100111111100110111101110000111111001111111111101110101010001111110011111101000010 9bdc3f3ffbaa3f3f9bdc3f3ffbaa3f3f9bdc3f3ffbaa3f3f9bdc3f3ffbaa3f3f42
EUC-JP 巖??諶??巖??諶??巖??諶??巖??諶??B 11010110110111100011111100111111100011111101111010110101001111110011111111010110110111100011111100111111100011111101111010110101001111110011111111010110110111100011111100111111100011111101111010110101001111110011111111010110110111100011111100111111100011111101111010110101001111110011111101000010 d6de3f3f8fdeb53f3fd6de3f3f8fdeb53f3fd6de3f3f8fdeb53f3fd6de3f3f8fdeb53f3f42
UTF-8 巖땹뢜諶뙊뢖巖땹뢜諶뙊뢆巖땹뢜諶뙊뢖巖땹뢜諶뙊뢆B 11100101101101111001011011101011100101011011100111101011101000101001110011101000101010111011011011101011100110011000101011101011101000101001011011100101101101111001011011101011100101011011100111101011101000101001110011101000101010111011011011101011100110011000101011101011101000101000011011100101101101111001011011101011100101011011100111101011101000101001110011101000101010111011011011101011100110011000101011101011101000101001011011100101101101111001011011101011100101011011100111101011101000101001110011101000101010111011011011101011100110011000101011101011101000101000011001000010 e5b796eb95b9eba29ce8abb6eb998aeba296e5b796eb95b9eba29ce8abb6eb998aeba286e5b796eb95b9eba29ce8abb6eb998aeba296e5b796eb95b9eba29ce8abb6eb998aeba28642
UHC 巖땹뢜諶뙊뢖巖땹뢜諶뙊뢆巖땹뢜諶뙊뢖巖땹뢜諶뙊뢆B 11100100110111001000101110001111100011110101011111100100101001101000110010001111100011110101000111100100110111001000101110001111100011110101011111100100101001101000110010001111100011110100001011100100110111001000101110001111100011110101011111100100101001101000110010001111100011110101000111100100110111001000101110001111100011110101011111100100101001101000110010001111100011110100001001000010 e4dc8b8f8f57e4a68c8f8f51e4dc8b8f8f57e4a68c8f8f42e4dc8b8f8f57e4a68c8f8f51e4dc8b8f8f57e4a68c8f8f4242

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)