To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 堰?????乙??嚥〓?異??碎??沃?? 10001001100000010011111100111111001111110011111100111111100010011011001100111111001111111001101010001011100000011010110000111111100010001101100100111111001111111110000111101010001111110011111110010111100000000011111100111111 89813f3f3f3f3f89b33f3f9a8b81ac3f88d93f3fe1ea3f3f97803f3f
EUC-JP 堰?????乙??嚥〓?異??碎??沃?? 10110001111000010011111100111111001111110011111100111111101100101011010100111111001111111101001111101011101000101010111000111111101100001101101100111111001111111110001011101100001111110011111111001101111000000011111100111111 b1e13f3f3f3f3fb2b53f3fd3eba2ae3fb0db3f3fe2ec3f3fcde03f3f
UTF-8 堰묐쓷流쒒걡乙쇈럶嚥〓끃異룩첑碎ㅻ깹沃쇱슞 111001011010000010110000111010111010110010010000111011001001001110110111111011111010011110001010111011001001001010010010111010101011000110100001111001001011100110011001111011001000011110001000111010111001111110110110111001011001101010100101111000111000000010010011111010111000000110000011111001111001010110110000111010111010001110101001111011001011001010010001111001111010001010001110111000111000010110111011111010101011100110111001111001101011001010000011111011001000011110110001111011001000101010011110 e5a0b0ebac90ec93b7efa78aec9292eab1a1e4b999ec8788eb9fb6e59aa5e38093eb8183e795b0eba3a9ecb291e7a28ee385bbeab9b9e6b283ec87b1ec8a9e
UHC 堰묐쓷流쒒걡乙쇈럶嚥〓끃異룩첑碎ㅻ깹沃쇱슞 111001011110100010010001111010111001110110010100111010101111110010011100111010011000000110001010111010111110000010111100111000111000111010010101111001101011111110100001111010111000010110111001111011001011011010110111111010001010101010011110111000011110111110100100111010111011001010100001111010001010101010111100111011001001101010101010 e5e891eb9d94eafc9ce9818aebe0bce38e95e6bfa1eb85b9ecb6b7e8aa9ee1efa4ebb2a1e8aabcec9aaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)