To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蟾ス螂ェ謐芽┳譌乗錐蟾ス螂ェ謐芽セソ蟄ォ螟 1110010110110111101111011110010110100101101010101110011010001101100010011110100010000100101100011110011010010111100011111110011010010000100011011110010110110111101111011110010110100101101010101110011010001101100010011110100010111110101111111110010110101101101010111110010110100100 e5b7bde5a5aae68d89e884b1e6978fe6908de5b7bde5a5aae68d89e8bebfe5adabe5a4
EUC-JP 蟾ス螂ェ謐芽┳譌乗錐蟾ス螂ェ謐芽セソ蟄ォ螟 111010101011100110001110101111011110101010100111100011101010101011101011111011011011001011101010101010001011001111101011111101111011111011101000101111111110110111101010101110011000111010111101111010101010011110001110101010101110101111101101101100101110101010001110101111101000111010111111111010101010111110001110101010111110101010100110 eab98ebdeaa78eaaebedb2eaa8b3ebf7bee8bfedeab98ebdeaa78eaaebedb2ea8ebe8ebfeaaf8eabeaa6
UTF-8 蟾ス螂ェ謐芽┳譌乗錐蟾ス螂ェ謐芽セソ蟄ォ螟 111010001001111110111110111011111011110110111101111010001001111010000010111011111011110110101010111010001010110010010000111010001000101010111101111000101001010010110011111010001010110110001100111001001011100110010111111010011000110010010000111010001001111110111110111011111011110110111101111010001001111010000010111011111011110110101010111010001010110010010000111010001000101010111101111011111011110110111110111011111011110110111111111010001001111110000100111011111011110110101011111010001001111010011111 e89fbeefbdbde89e82efbdaae8ac90e88abde294b3e8ad8ce4b997e98c90e89fbeefbdbde89e82efbdaae8ac90e88abdefbdbeefbdbfe89f84efbdabe89e9f
UHC 蟾?螂?謐芽┳??錐蟾?螂?謐芽??蟄?螟 111000001110101000111111110101011100110000111111110110101100110111100100101101001010011010110011001111110011111111110101110111101110000011101010001111111101010111001100001111111101101011001101111001001011010000111111001111111111011011011110001111111101100110101101 e0ea3fd5cc3fdacde4b4a6b33f3ff5dee0ea3fd5cc3fdacde4b43f3ff6de3fd9ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)