To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 外?い節??訝??巍??外?い節??訝??巍??B 1000101001001111001111111000001010100010100100001101111100111111001111111110011001100010001111110011111110011011110110010011111100111111100010100100111100111111100000101010001010010000110111110011111100111111111001100110001000111111001111111001101111011001001111110011111101000010 8a4f3f82a290df3f3fe6623f3f9bd93f3f8a4f3f82a290df3f3fe6623f3f9bd93f3f42
EUC-JP 外?い節??訝??巍??外?い節??訝??巍??B 1011001110110000001111111010010010100100110000001110000100111111001111111110101111000011001111110011111111010110110110110011111100111111101100111011000000111111101001001010010011000000111000010011111100111111111010111100001100111111001111111101011011011011001111110011111101000010 b3b03fa4a4c0e13f3febc33f3fd6db3f3fb3b03fa4a4c0e13f3febc33f3fd6db3f3f42
UTF-8 外귟い節멩솳訝뽭뼚巍먪돍外귟い節멩솳訝뽭뼚巍먪돍B 11100101101001001001011011101010101101111001111111100011100000011000010011100111101011111000000011101011101010011010100111101100100001101011001111101000101010001001110111101011101111011010110111101011101111001001101011100101101101111000110111101011101010001010101011101011100011111000110111100101101001001001011011101010101101111001111111100011100000011000010011100111101011111000000011101011101010011010100111101100100001101011001111101000101010001001110111101011101111011010110111101011101111001001101011100101101101111000110111101011101010001010101011101011100011111000110101000010 e5a496eab79fe38184e7af80eba9a9ec86b3e8a89debbdadebbc9ae5b78deba8aaeb8f8de5a496eab79fe38184e7af80eba9a9ec86b3e8a89debbdadebbc9ae5b78deba8aaeb8f8d42
UHC 外귟い節멩솳訝뽭뼚巍먪돍外귟い節멩솳訝뽭뼚巍먪돍B 11101000111000101000001011101000101010101010010011101111101111011011100011100110100110011010100011100100101110001001011011101001100101101010000011101000111001001001000011100111100010011001101111101000111000101000001011101000101010101010010011101111101111011011100011100110100110011010100011100100101110001001011011101001100101101010000011101000111001001001000011100111100010011001101101000010 e8e282e8aaa4efbdb8e699a8e4b896e996a0e8e490e7899be8e282e8aaa4efbdb8e699a8e4b896e996a0e8e490e7899b42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)