To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 譚台サ匁搗蛛エ譚台サ匁搗譌乗搗莉匁搗蛛エ 111001101001110110010001111001001011101110010110111001101001110110010001111001011000000110110100111001101001110110010001111001001011101110010110111001101001110110010001111001101001011110001111111001101001110110010001111001001011101110010110111001101001110110010001111001011000000110110100 e69d91e4bb96e69d91e581b4e69d91e4bb96e69d91e6978fe69d91e4bb96e69d91e581b4
EUC-JP 譚台サ匁搗蛛エ譚台サ匁搗譌乗搗莉匁搗蛛エ 11101011111111011100001011100110100011101011101111001100111010001101100111110001111010011110000110001110101101001110101111111101110000101110011010001110101110111100110011101000110110011111000111101011111101111011111011101000110110011111000111101000101111011100110011101000110110011111000111101001111000011000111010110100 ebfdc2e68ebbcce8d9f1e9e18eb4ebfdc2e68ebbcce8d9f1ebf7bee8d9f1e8bdcce8d9f1e9e18eb4
UTF-8 譚台サ匁搗蛛エ譚台サ匁搗譌乗搗莉匁搗蛛エ 111010001010110110011010111001011000111110110000111011111011110110111011111001011000110010000001111001101001000010010111111010001001101110011011111011111011110110110100111010001010110110011010111001011000111110110000111011111011110110111011111001011000110010000001111001101001000010010111111010001010110110001100111001001011100110010111111001101001000010010111111010001000111010001001111001011000110010000001111001101001000010010111111010001001101110011011111011111011110110110100 e8ad9ae58fb0efbdbbe58c81e69097e89b9befbdb4e8ad9ae58fb0efbdbbe58c81e69097e8ad8ce4b997e69097e88e89e58c81e69097e89b9befbdb4
UHC 譚台??搗蛛?譚台??搗??搗莉?搗蛛? 11010011110010011111011110111011001111110011111111010011111111011111000111001000001111111101001111001001111101111011101100111111001111111101001111111101001111110011111111010011111111011101011111101001001111111101001111111101111100011100100000111111 d3c9f7bb3f3fd3fdf1c83fd3c9f7bb3f3fd3fd3f3fd3fdd7e93fd3fdf1c83f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)