To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譚台サ匁搗陲匁搗莉匁搗謐画搗莉匁搗螟 1110011010011101100100011110010010111011100101101110011010011101100100011110100010100010100101101110011010011101100100011110010010111011100101101110011010011101100100011110011010001101100010011110011010011101100100011110010010111011100101101110011010011101100100011110010110100100 e69d91e4bb96e69d91e8a296e69d91e4bb96e69d91e68d89e69d91e4bb96e69d91e5a4
EUC-JP 譚台サ匁搗陲匁搗莉匁搗謐画搗莉匁搗螟 111010111111110111000010111001101000111010111011110011001110100011011001111100011111000010100100110011001110100011011001111100011110100010111101110011001110100011011001111100011110101111101101101100101110100011011001111100011110100010111101110011001110100011011001111100011110101010100110 ebfdc2e68ebbcce8d9f1f0a4cce8d9f1e8bdcce8d9f1ebedb2e8d9f1e8bdcce8d9f1eaa6
UTF-8 譚台サ匁搗陲匁搗莉匁搗謐画搗莉匁搗螟 111010001010110110011010111001011000111110110000111011111011110110111011111001011000110010000001111001101001000010010111111010011001100110110010111001011000110010000001111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010001010110010010000111001111001010010111011111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010001001111010011111 e8ad9ae58fb0efbdbbe58c81e69097e999b2e58c81e69097e88e89e58c81e69097e8ac90e794bbe69097e88e89e58c81e69097e89e9f
UHC 譚台??搗??搗莉?搗謐?搗莉?搗螟 1101001111001001111101111011101100111111001111111101001111111101001111110011111111010011111111011101011111101001001111111101001111111101110110101100110100111111110100111111110111010111111010010011111111010011111111011101100110101101 d3c9f7bb3f3fd3fd3f3fd3fdd7e93fd3fddacd3fd3fdd7e93fd3fdd9ad

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)