To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譚台サ匁搗驕懈搗莉匁搗蟄俶搗莉匁搗譚第搗莉 1110011010011101100100011110010010111011100101101110011010011101100100011110100110000001100111001110011010011101100100011110010010111011100101101110011010011101100100011110010110101101100110001110011010011101100100011110010010111011100101101110011010011101100100011110011010011101100100011110011010011101100100011110010010111011 e69d91e4bb96e69d91e9819ce69d91e4bb96e69d91e5ad98e69d91e4bb96e69d91e69d91e69d91e4bb
EUC-JP 譚台サ匁搗驕懈搗莉匁搗蟄俶搗莉匁搗譚第搗莉 111010111111110111000010111001101000111010111011110011001110100011011001111100011111000111100001110110001110100011011001111100011110100010111101110011001110100011011001111100011110101010101111110100001110100011011001111100011110100010111101110011001110100011011001111100011110101111111101110000101110100011011001111100011110100010111101 ebfdc2e68ebbcce8d9f1f1e1d8e8d9f1e8bdcce8d9f1eaafd0e8d9f1e8bdcce8d9f1ebfdc2e8d9f1e8bd
UTF-8 譚台サ匁搗驕懈搗莉匁搗蟄俶搗莉匁搗譚第搗莉 111010001010110110011010111001011000111110110000111011111011110110111011111001011000110010000001111001101001000010010111111010011010100110010101111001101000011110001000111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010001001111110000100111001001011111110110110111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010001010110110011010111001111010110010101100111001101001000010010111111010001000111010001001 e8ad9ae58fb0efbdbbe58c81e69097e9a995e68788e69097e88e89e58c81e69097e89f84e4bfb6e69097e88e89e58c81e69097e8ad9ae7acace69097e88e89
UHC 譚台??搗驕懈搗莉?搗蟄?搗莉?搗譚第搗莉 11010011110010011111011110111011001111110011111111010011111111011100111011110110111110101010101111010011111111011101011111101001001111111101001111111101111101101101111000111111110100111111110111010111111010010011111111010011111111011101001111001001111100001010111111010011111111011101011111101001 d3c9f7bb3f3fd3fdcef6faabd3fdd7e93fd3fdf6de3fd3fdd7e93fd3fdd3c9f0afd3fdd7e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)