To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 歎息丹誰探袖歎息丹誰丹誰歎息丹誰単端B 10010010010101101001000110100111100100100100111110010010010011101001001001010100100100011011001110010010010101101001000110100111100100100100111110010010010011101001001001001111100100100100111010010010010101101001000110100111100100100100111110010010010011101001001001010000100100100101101101000010 925691a7924f924e925491b3925691a7924f924e924f924e925691a7924f924e9250925b42
EUC-JP 歎息丹誰探袖歎息丹誰丹誰歎息丹誰単端B 11000011101101111100001010101001110000111011000011000011101011111100001110110101110000101011010111000011101101111100001010101001110000111011000011000011101011111100001110110000110000111010111111000011101101111100001010101001110000111011000011000011101011111100001110110001110000111011110001000010 c3b7c2a9c3b0c3afc3b5c2b5c3b7c2a9c3b0c3afc3b0c3afc3b7c2a9c3b0c3afc3b1c3bc42
UTF-8 歎息丹誰探袖歎息丹誰丹誰歎息丹誰単端B 11100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100110100011101010001011101000101000101001011011100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100100101110001011100111101000101010101011000011100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100101100011011001100011100111101010111010111101000010 e6ad8ee681afe4b8b9e8aab0e68ea2e8a296e6ad8ee681afe4b8b9e8aab0e4b8b9e8aab0e6ad8ee681afe4b8b9e8aab0e58d98e7abaf42
UHC 歎息丹誰探袖歎息丹誰丹誰歎息丹誰?端B 111101111010011111100011110100111101001110100001111000101100000111110111101011101110001011000000111101111010011111100011110100111101001110100001111000101100000111010011101000011110001011000001111101111010011111100011110100111101001110100001111000101100000100111111110100111010111001000010 f7a7e3d3d3a1e2c1f7aee2c0f7a7e3d3d3a1e2c1d3a1e2c1f7a7e3d3d3a1e2c13fd3ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)