To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 橈?????飮??語??踰??矣??佯??油 1001111011110100001111110011111100111111001111110011111110011111010110100011111100111111100011001110101000111111001111111110011011111010001111110011111111100001111000010011111100111111100110001101000100111111001111111001011011111011 9ef43f3f3f3f3f9f5a3f3f8cea3f3fe6fa3f3fe1e13f3f98d13f3f96fb
EUC-JP 橈?????飮??語??踰??矣??佯??油 1101110011110110001111110011111100111111001111110011111111011101101110110011111100111111101110001110110000111111001111111110110011111100001111110011111111100010111000110011111100111111110100001101001100111111001111111100110011111101 dcf63f3f3f3f3fddbb3f3fb8ec3f3fecfc3f3fe2e33f3fd0d33f3fccfd
UTF-8 橈볦뼚杻뚦뭡飮곷겱語ⓦ꺆踰뽪첀矣곕폏佯얠뜾油 111001101010100110001000111010111011001110100110111010111011110010011010111011111010011110001000111010111001101010100110111010111010110110100001111010011010001110101110111010101011001110110111111010101011001010110001111010001010101010011110111000101001001110100110111010101011101010000110111010001011100010110000111010111011110110101010111011001011001010000000111001111001111110100011111010101011001110010101111011011000111110001111111001001011110110101111111011001001011010100000111010111001110010111110111001101011001010111001 e6a988ebb3a6ebbc9aefa788eb9aa6ebada1e9a3aeeab3b7eab2b1e8aa9ee293a6eaba86e8b8b0ebbdaaecb280e79fa3eab395ed8f8fe4bdafec96a0eb9cbee6b2b9
UHC 橈볦뼚杻뚦뭡飮곷겱語ⓦ꺆踰뽪첀矣곕폏佯얠뜾油 1110100011111010100100111110110010010110101000001110101011110100100011001110010110111001101111001110101111100110100000011110101110000001101111011110010111011110101010001110001110000011101011011110101110110010100101101110011010101010100011011110101111111000101100001110101110111100100110101110010110111010101111101110110010001101101110011110101011111010 e8fa93ec96a0eaf48ce5b9bcebe681eb81bde5dea8e383adebb296e6aa8debf8b0ebbc9ae5babeec8db9eafa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)