To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 異?仲???瀕??垣嘔異?仲???瀕??垣嘔^ 100010001101100100111111100100101000011100111111001111110011111110010101011011010011111100111111100010100101111110011010011100011000100011011001001111111001001010000111001111110011111100111111100101010110110100111111001111111000101001011111100110100111000101011110 88d93f92873f3f3f956d3f3f8a5f9a7188d93f92873f3f3f956d3f3f8a5f9a715e
EUC-JP 異?仲???瀕??垣嘔異?仲???瀕??垣嘔^ 101100001101101100111111110000111110011100111111001111110011111111001001110011100011111100111111101100111100000011010011110100101011000011011011001111111100001111100111001111110011111100111111110010011100111000111111001111111011001111000000110100111101001001011110 b0db3fc3e73f3f3fc9ce3f3fb3c0d3d2b0db3fc3e73f3f3fc9ce3f3fb3c0d3d25e
UTF-8 異렔仲쭹렟닻瀕罹렖垣嘔異렔仲쭹렟닻瀕罹렖垣嘔^ 11100111100101011011000011101011101000001001010011100100101110111011001011101100101011011011100111101011101000001001111111101011100010111011101111100111100000001001010111101111101001111010011011101011101000001001011011100101100111101010001111100101100110001001010011100111100101011011000011101011101000001001010011100100101110111011001011101100101011011011100111101011101000001001111111101011100010111011101111100111100000001001010111101111101001111010011011101011101000001001011011100101100111101010001111100101100110001001010001011110 e795b0eba094e4bbb2ecadb9eba09feb8bbbe78095efa7a6eba096e59ea3e59894e795b0eba094e4bbb2ecadb9eba09feb8bbbe78095efa7a6eba096e59ea3e598945e
UHC 異렔仲쭹렟닻瀕罹렖垣嘔異렔仲쭹렟닻瀕罹렖垣嘔^ 111011001011011010001110101010011111000111101010110000101110011110001110101100001011010011101001110111101011010111101100101110101000111010101011111010101010111111001111101001011110110010110110100011101010100111110001111010101100001011100111100011101011000010110100111010011101111010110101111011001011101010001110101010111110101010101111110011111010010101011110 ecb68ea9f1eac2e78eb0b4e9deb5ecba8eabeaafcfa5ecb68ea9f1eac2e78eb0b4e9deb5ecba8eabeaafcfa55e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)