To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 嗚??泣??循??語??踰??矣??佯??? 1001101001101010001111110011111110001011100000110011111100111111100011110111101000111111001111111000110011101010001111110011111111100110111110100011111100111111111000011110000100111111001111111001100011010001001111110011111100111111 9a6a3f3f8b833f3f8f7a3f3f8cea3f3fe6fa3f3fe1e13f3f98d13f3f3f
EUC-JP 嗚??泣??循??語??踰??矣??佯??? 1101001111001011001111110011111110110101111000110011111100111111101111011101101100111111001111111011100011101100001111110011111111101100111111000011111100111111111000101110001100111111001111111101000011010011001111110011111100111111 d3cb3f3fb5e33f3fbddb3f3fb8ec3f3fecfc3f3fe2e33f3fd0d33f3f3f
UTF-8 嗚삠굦泣쒎ㅇ循녿겱語ⓦ꺆踰뽪첀矣곕폏佯얠뜴杻 111001011001011110011010111011001000001010100000111010101011010110100110111001101011001110100011111011001001001010001110111000111000010110000111111001011011111010101010111010111000010110111111111010101011001010110001111010001010101010011110111000101001001110100110111010101011101010000110111010001011100010110000111010111011110110101010111011001011001010000000111001111001111110100011111010101011001110010101111011011000111110001111111001001011110110101111111011001001011010100000111010111001110010110100111011111010011110001000 e5979aec82a0eab5a6e6b3a3ec928ee38587e5beaaeb85bfeab2b1e8aa9ee293a6eaba86e8b8b0ebbdaaecb280e79fa3eab395ed8f8fe4bdafec96a0eb9cb4efa788
UHC 嗚삠굦泣쒎ㅇ循녿겱語ⓦ꺆踰뽪첀矣곕폏佯얠뜴杻 1110011111110000101110111110001110000010100011001110101111101000100111001110010110100100101101111110001011100000100001101110101110000001101111011110010111011110101010001110001110000011101011011110101110110010100101101110011010101010100011011110101111111000101100001110101110111100100110101110010110111010101111101110110010001101101100101110101011110100 e7f0bbe3828cebe89ce5a4b7e2e086eb81bde5dea8e383adebb296e6aa8debf8b0ebbc9ae5babeec8db2eaf4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)