To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 塋??衣??乙μ?[塋??衣??乙μ?[^ 1001101011001000001111110011111110001000110111110011111100111111100010011011001110000011110010100011111101011011100110101100100000111111001111111000100011011111001111110011111110001001101100111000001111001010001111110101101101011110 9ac83f3f88df3f3f89b383ca3f5b9ac83f3f88df3f3f89b383ca3f5b5e
EUC-JP 塋??衣??乙μ?[塋??衣??乙μ?[^ 1101010011001010001111110011111110110000111000010011111100111111101100101011010110100110110011000011111101011011110101001100101000111111001111111011000011100001001111110011111110110010101101011010011011001100001111110101101101011110 d4ca3f3fb0e13f3fb2b5a6cc3f5bd4ca3f3fb0e13f3fb2b5a6cc3f5b5e
UTF-8 塋좎궏衣사쳽乙μ읅[塋좎궏衣사쳽乙μ읅[^ 11100101101000011000101111101100101000101000111011101010101101101000111111101000101000011010001111101100100000101010110011101100101100111011110111100100101110011001100111001110101111001110110010011101100001010101101111100101101000011000101111101100101000101000111011101010101101101000111111101000101000011010001111101100100000101010110011101100101100111011110111100100101110011001100111001110101111001110110010011101100001010101101101011110 e5a18beca28eeab68fe8a1a3ec82acecb3bde4b999cebcec9d855be5a18beca28eeab68fe8a1a3ec82acecb3bde4b999cebcec9d855b5e
UHC 塋좎궏衣사쳽乙μ읅[塋좎궏衣사쳽乙μ읅[^ 111001111010101110100000111011001000001010100101111010111111110110111011111001111010101110100000111010111110000010100101111011001001111110111011010110111110011110101011101000001110110010000010101001011110101111111101101110111110011110101011101000001110101111100000101001011110110010011111101110110101101101011110 e7aba0ec82a5ebfdbbe7aba0ebe0a5ec9fbb5be7aba0ec82a5ebfdbbe7aba0ebe0a5ec9fbb5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)