To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蒻れ?逾??怨???〓???????椰 11100100111010001000001011101010001111111110011110100101001111110011111110001001100001010011111100111111001111111000000110101100001111110011111100111111001111110011111100111111001111111001111010111101 e4e882ea3fe7a53f3f89853f3f3f81ac3f3f3f3f3f3f3f9ebd
EUC-JP 蒻れ?逾??怨??璵〓?庾?????椰 1110100011101010101001001110110000111111111011101010011100111111001111111011000111100101001111110011111110001111110011001110011010100010101011100011111110001111101111001100111000111111001111110011111100111111001111111101110010111111 e8eaa4ec3feea73f3fb1e53f3f8fcce6a2ae3f8fbcce3f3f3f3f3fdcbf
UTF-8 蒻れ슜逾껅끽怨뀀뙑璵〓끃庾썼삏戮곕펶椰 111010001001001010111011111000111000001010001100111011001000101010011100111010011000000010111110111010101011101110000101111010111000000110111101111001101000000010101000111010111000000010000000111010111001100110010001111001111001001010110101111000111000000010010011111010111000000110000011111001011011101010111110111011001000110110111100111011001000001010001111111011111010011110010010111010101011001110010101111011011000111010110110111001101010010010110000 e892bbe3828cec8a9ce980beeabb85eb81bde680a8eb8080eb9991e792b5e38093eb8183e5babeec8dbcec828fefa792eab395ed8eb6e6a4b0
UHC 蒻れ슜逾껅끽怨뀀뙑璵〓끃庾썼삏戮곕펶椰 1110010110110110101010101110110010011010101010011110101110110101100000111110011010110011101000111110101010110011101100101110101110001100100101101110011010100101101000011110101110000101101110011110101011101100101111011110100010011000100101101110101110111101101100001110101110111100100001111110010110101011 e5b6aaec9aa9ebb583e6b3a3eab3b2eb8c96e6a5a1eb85b9eaecbde89896ebbdb0ebbc87e5ab

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)