To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 蘖?????窈?????楡??孃る???? 1001111101010000001111110011111100111111001111110011111111100010011101110011111100111111001111110011111100111111100111101011111000111111001111111001101101101111100000101110100100111111001111110011111100111111 9f503f3f3f3f3fe2773f3f3f3f3f9ebe3f3f9b6f82e93f3f3f3f
EUC-JP 蘖?????窈??沅??楡??孃る?靷?? 110111011011000100111111001111110011111100111111001111111110001111011000001111110011111110001111110001101110100100111111001111111101110011000000001111110011111111010101110100001010010011101011001111111000111111100111101111010011111100111111 ddb13f3f3f3f3fe3d83f3f8fc6e93f3fdcc03f3fd5d0a4eb3f8fe7bd3f3f
UTF-8 蘖뽮퉭栒륂슑窈띾맩沅뤶튃楡녹췀孃る뿭靷뽪릸 111010001001100010010110111010111011110110101110111011011000100110101101111001101010000010010010111010111010010110000010111011001000101010010001111001111010101010001000111010111001110110111110111010111010011110101001111001101011001010000101111010111010010010110110111011011000101010000011111001101010010110100001111010111000010110111001111011001011011110000000111001011010110110000011111000111000001010001011111010111011111110101101111010011001110110110111111010111011110110101010111010111010011010111000 e89896ebbdaeed89ade6a092eba582ec8a91e7aa88eb9dbeeba7a9e6b285eba4b6ed8a83e6a5a1eb85b9ecb780e5ad83e3828bebbfade99db7ebbdaaeba6b8
UHC 蘖뽮퉭栒륂슑窈띾맩沅뤶튃楡녹췀孃る뿭靷뽪릸 111001011110111010010110111010101011100110000101111000101110001110001111111011011001101010100000111010011010000110001101111010111001000010110001111010101011011010001111111001001011100110011001111010101111100010110011111011001010110110011100111001011011111010101010111010111001011110101101111011001110011010010110111001101001000010010110 e5ee96eab985e2e38fed9aa0e9a18deb90b1eab68fe4b999eaf8b3ecad9ce5beaaeb97adece696e69096

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)