To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 猥??穩?????暗∽?節??押??曜?? 11100000110011100011111100111111111000100111001000111111001111110011111100111111001111111000100011000011100000011110010000111111100100001101111100111111001111111000100110011111001111110011111110010111011010100011111100111111 e0ce3f3fe2723f3f3f3f3f88c381e43f90df3f3f899f3f3f976a3f3f
EUC-JP 猥??穩??縕??暗∽?節??押??曜?? 111000001101000000111111001111111110001111010011001111110011111110001111110101001100001000111111001111111011000011000101101000101110011000111111110000001110000100111111001111111011001010100001001111110011111111001101110010110011111100111111 e0d03f3fe3d33f3f8fd4c23f3fb0c5a2e63fc0e13f3fb2a13f3fcdcb3f3f
UTF-8 猥듸슁穩롥뀍縕붼슫暗∽풛節녘콒押덅쾾曜닻뙯 111001111000110010100101111010111001001110111000111011001000101010000001111001111010100110101001111010111010000110100101111010111000000010001101111001111011100010010101111010111011011010111100111011001000101010101011111001101001101010010111111000101000100010111101111011011001001010011011111001111010111110000000111010111000010110011000111011001011110110010010111001101000101010111100111010111000110110000101111011001011111010111110111001101001101110011100111010111000101110111011111010111001100110101111 e78ca5eb93b8ec8a81e7a9a9eba1a5eb808de7b895ebb6bcec8aabe69a97e288bded929be7af80eb8598ecbd92e68abceb8d85ecbebee69b9ceb8bbbeb99af
UHC 猥듸슁穩롥뀍縕붼슫暗∽풛節녘콒押덅쾾曜닻뙯 111010001110010110110101111011111011110110110011111010001011000110001110111001011000010110001000111010001011001010010100111010011001101010110100111001001101111010100001111011111011111010011110111011111011110110110011111010001011000110001110111001001110001110001000111010001011001010010100111010001111100010110100111010011000110010110010 e8e5b5efbdb3e8b18ee58588e8b294e99ab4e4dea1efbe9eefbdb3e8b18ee4e388e8b294e8f8b4e98cb2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)