To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譚台サ匁搗謳肴搗莉匁搗譚滓搗莉匁搗驕懈搗莉 1110011010011101100100011110010010111011100101101110011010011101100100011110011010010000100011011110011010011101100100011110010010111011100101101110011010011101100100011110011010011101100111111110011010011101100100011110010010111011100101101110011010011101100100011110100110000001100111001110011010011101100100011110010010111011 e69d91e4bb96e69d91e6908de69d91e4bb96e69d91e69d9fe69d91e4bb96e69d91e9819ce69d91e4bb
EUC-JP 譚台サ匁搗謳肴搗莉匁搗譚滓搗莉匁搗驕懈搗莉 111010111111110111000010111001101000111010111011110011001110100011011001111100011110101111110000101110101110100011011001111100011110100010111101110011001110100011011001111100011110101111111101110111101110100011011001111100011110100010111101110011001110100011011001111100011111000111100001110110001110100011011001111100011110100010111101 ebfdc2e68ebbcce8d9f1ebf0bae8d9f1e8bdcce8d9f1ebfddee8d9f1e8bdcce8d9f1f1e1d8e8d9f1e8bd
UTF-8 譚台サ匁搗謳肴搗莉匁搗譚滓搗莉匁搗驕懈搗莉 111010001010110110011010111001011000111110110000111011111011110110111011111001011000110010000001111001101001000010010111111010001010110010110011111010001000001010110100111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010001010110110011010111001101011101110010011111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010011010100110010101111001101000011110001000111001101001000010010111111010001000111010001001 e8ad9ae58fb0efbdbbe58c81e69097e8acb3e882b4e69097e88e89e58c81e69097e8ad9ae6bb93e69097e88e89e58c81e69097e9a995e68788e69097e88e89
UHC 譚台??搗謳肴搗莉?搗譚滓搗莉?搗驕懈搗莉 1101001111001001111101111011101100111111001111111101001111111101110011111100010011111101101000101101001111111101110101111110100100111111110100111111110111010011110010011110111010101011110100111111110111010111111010010011111111010011111111011100111011110110111110101010101111010011111111011101011111101001 d3c9f7bb3f3fd3fdcfc4fda2d3fdd7e93fd3fdd3c9eeabd3fdd7e93fd3fdcef6faabd3fdd7e9

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)