To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 歎息丹誰丹樽歎息丹誰丹誰歎息丹誰丹樽歎息丹誰丹誰^ 10010010010101101001000110100111100100100100111110010010010011101001001001001111100100100100110110010010010101101001000110100111100100100100111110010010010011101001001001001111100100100100111010010010010101101001000110100111100100100100111110010010010011101001001001001111100100100100110110010010010101101001000110100111100100100100111110010010010011101001001001001111100100100100111001011110 925691a7924f924e924f924d925691a7924f924e924f924e925691a7924f924e924f924d925691a7924f924e924f924e5e
EUC-JP 歎息丹誰丹樽歎息丹誰丹誰歎息丹誰丹樽歎息丹誰丹誰^ 11000011101101111100001010101001110000111011000011000011101011111100001110110000110000111010111011000011101101111100001010101001110000111011000011000011101011111100001110110000110000111010111111000011101101111100001010101001110000111011000011000011101011111100001110110000110000111010111011000011101101111100001010101001110000111011000011000011101011111100001110110000110000111010111101011110 c3b7c2a9c3b0c3afc3b0c3aec3b7c2a9c3b0c3afc3b0c3afc3b7c2a9c3b0c3afc3b0c3aec3b7c2a9c3b0c3afc3b0c3af5e
UTF-8 歎息丹誰丹樽歎息丹誰丹誰歎息丹誰丹樽歎息丹誰丹誰^ 11100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100100101110001011100111100110101010001011110111100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100100101110001011100111101000101010101011000011100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100100101110001011100111100110101010001011110111100110101011011000111011100110100000011010111111100100101110001011100111101000101010101011000011100100101110001011100111101000101010101011000001011110 e6ad8ee681afe4b8b9e8aab0e4b8b9e6a8bde6ad8ee681afe4b8b9e8aab0e4b8b9e8aab0e6ad8ee681afe4b8b9e8aab0e4b8b9e6a8bde6ad8ee681afe4b8b9e8aab0e4b8b9e8aab05e
UHC 歎息丹誰丹樽歎息丹誰丹誰歎息丹誰丹樽歎息丹誰丹誰^ 11110111101001111110001111010011110100111010000111100010110000011101001110100001111100011101110011110111101001111110001111010011110100111010000111100010110000011101001110100001111000101100000111110111101001111110001111010011110100111010000111100010110000011101001110100001111100011101110011110111101001111110001111010011110100111010000111100010110000011101001110100001111000101100000101011110 f7a7e3d3d3a1e2c1d3a1f1dcf7a7e3d3d3a1e2c1d3a1e2c1f7a7e3d3d3a1e2c1d3a1f1dcf7a7e3d3d3a1e2c1d3a1e2c15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)