To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????n}v???????n}vB 001111110011111100111111001111110011111100111111001111110110111001111101011101100011111100111111001111110011111100111111001111110011111101101110011111010111011001000010 3f3f3f3f3f3f3f6e7d763f3f3f3f3f3f3f6e7d7642
SJIS-WIN 巐ゥ巐餓ゥ被嗇n}v巐ゥ巐餓ゥ被嗇n}vB 11111010101101101010100111111010101101101000100111101100101010011001010011101101100110101010010101101110011111010111011011111010101101101010100111111010101101101000100111101100101010011001010011101101100110101010010101101110011111010111011001000010 fab6a9fab689eca994ed9aa56e7d76fab6a9fab689eca994ed9aa56e7d7642
EUC-JP 巐ゥ巐餓ゥ被嗇n}v巐ゥ巐餓ゥ被嗇n}vB 100011111011101111111001100011101010100110001111101110111111100110110010111011101000111010101001110010001110111111010100101001110110111001111101011101101000111110111011111110011000111010101001100011111011101111111001101100101110111010001110101010011100100011101111110101001010011101101110011111010111011001000010 8fbbf98ea98fbbf9b2ee8ea9c8efd4a76e7d768fbbf98ea98fbbf9b2ee8ea9c8efd4a76e7d7642
UTF-8 巐ゥ巐餓ゥ被嗇n}v巐ゥ巐餓ゥ被嗇n}vB 11100101101101111001000011101111101111011010100111100101101101111001000011101001101001001001001111101111101111011010100111101000101000101010101111100101100101111000011101101110011111010111011011100101101101111001000011101111101111011010100111100101101101111001000011101001101001001001001111101111101111011010100111101000101000101010101111100101100101111000011101101110011111010111011001000010 e5b790efbda9e5b790e9a493efbda9e8a2abe597876e7d76e5b790efbda9e5b790e9a493efbda9e8a2abe597876e7d7642
UHC ???餓?被嗇n}v???餓?被嗇n}vB 001111110011111100111111111001001011101100111111111110011010110011011111111000000110111001111101011101100011111100111111001111111110010010111011001111111111100110101100110111111110000001101110011111010111011001000010 3f3f3fe4bb3ff9acdfe06e7d763f3f3fe4bb3ff9acdfe06e7d7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)