To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????B 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 貉ソ迺ス辷イ貉ソ譁廰貉ソ迺ス辷イ貉ソ譁廰B 111001101011100110111111111001111001001010111101111001111000100010110010111001101011100110111111111001101001011010011100010011001110011010111001101111111110011110010010101111011110011110001000101100101110011010111001101111111110011010010110100111000100110001000010 e6b9bfe792bde788b2e6b9bfe6969c4ce6b9bfe792bde788b2e6b9bfe6969c4c42
EUC-JP 貉ソ迺ス辷イ貉ソ譁廰貉ソ迺ス辷イ貉ソ譁廰B 1110110010111011100011101011111111101101111100101000111010111101111011011110100010001110101100101110110010111011100011101011111111101011111101101101011110101101111011001011101110001110101111111110110111110010100011101011110111101101111010001000111010110010111011001011101110001110101111111110101111110110110101111010110101000010 ecbb8ebfedf28ebdede88eb2ecbb8ebfebf6d7adecbb8ebfedf28ebdede88eb2ecbb8ebfebf6d7ad42
UTF-8 貉ソ迺ス辷イ貉ソ譁廰貉ソ迺ス辷イ貉ソ譁廰B 11101000101100101000100111101111101111011011111111101000101111111011101011101111101111011011110111101000101111101011011111101111101111011011001011101000101100101000100111101111101111011011111111101000101011011000000111100101101110111011000011101000101100101000100111101111101111011011111111101000101111111011101011101111101111011011110111101000101111101011011111101111101111011011001011101000101100101000100111101111101111011011111111101000101011011000000111100101101110111011000001000010 e8b289efbdbfe8bfbaefbdbde8beb7efbdb2e8b289efbdbfe8ad81e5bbb0e8b289efbdbfe8bfbaefbdbde8beb7efbdb2e8b289efbdbfe8ad81e5bbb042
UHC ????????譁?????????譁?B 0011111100111111001111110011111100111111001111110011111100111111111111001010011000111111001111110011111100111111001111110011111100111111001111110011111111111100101001100011111101000010 3f3f3f3f3f3f3f3ffca63f3f3f3f3f3f3f3f3ffca63f42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)