To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???泣?ぜ矣?ザ艶b?誼??懿??沃?? 001111110011111100111111100010111000001100111111100000101011101011100001111000010011111110000011010101011000100110010000100000101000001000111111100010110110001000111111001111111001110011110010001111110011111110010111100000000011111100111111 3f3f3f8b833f82bae1e13f8355899082823f8b623f3f9cf23f3f97803f3f
EUC-JP ???泣?ぜ矣?ザ艶b?誼??懿??沃?? 001111110011111100111111101101011110001100111111101001001011110011100010111000110011111110100101101101101011000111110000101000111110001000111111101101011100001100111111001111111101100011110100001111110011111111001101111000000011111100111111 3f3f3fb5e33fa4bce2e33fa5b6b1f0a3e23fb5c33f3fd8f43f3fcde03f3f
UTF-8 捻뀀씮泣쒑ぜ矣몄ザ艶b뫗誼띺쪛懿몄뒧沃쇳뀽 111011111010011010100100111010111000000010000000111011001001010010101110111001101011001110100011111011001001001010010001111000111000000110011100111001111001111110100011111010111010101010000100111000111000001010110110111010001000100110110110111011111011110110000010111010111010101110010111111010001010101010111100111010111001110110111010111011001010101010011011111001101000011110111111111010111010101010000100111010111001001010100111111001101011001010000011111011001000011110110011111010111000000010111101 efa6a4eb8080ec94aee6b3a3ec9291e3819ce79fa3ebaa84e382b6e889b6efbd82ebab97e8aabceb9dbaecaa9be687bfebaa84eb92a7e6b283ec87b3eb80bd
UHC 捻뀀씮泣쒑ぜ矣몄ザ艶b뫗誼띺쪛懿몄뒧沃쇳뀽 111001101111011110110010111010111001110110111111111010111110100010011100111010001010101010111100111010111111100010111000111011001010101110110110111001101111110110100011111000101001000110111001111010111111111010001101111010011010010110010100111010111111001110111000111011001000101010100010111010001010101010111100111011011000010110110011 e6f7b2eb9dbfebe89ce8aabcebf8b8ecabb6e6fda3e291b9ebfe8de9a594ebf3b8ec8aa2e8aabced85b3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)