To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 艾??誼ヨズ楡⑦?艾??誼ヨズ楡??^ 111001001000100000111111001111111000101101100010100000111000100010000011010110011001111010111110100001110100011000111111111001001000100000111111001111111000101101100010100000111000100010000011010110011001111010111110001111110011111101011110 e4883f3f8b62838883599ebe87463fe4883f3f8b62838883599ebe3f3f5e
EUC-JP 艾??誼ヨズ楡??艾??誼ヨズ楡??^ 1110011111101000001111110011111110110101110000111010010111101000101001011011101011011100110000000011111100111111111001111110100000111111001111111011010111000011101001011110100010100101101110101101110011000000001111110011111101011110 e7e83f3fb5c3a5e8a5badcc03f3fe7e83f3fb5c3a5e8a5badcc03f3f5e
UTF-8 艾싲챶誼ヨズ楡⑦돧艾싲챶誼ヨズ楡㏃쭖^ 11101000100010011011111011101100100010111011001011101100101100011011011011101000101010101011110011100011100000111010100011100011100000101011101011100110101001011010000111100010100100011010011011101011100011111010011111101000100010011011111011101100100010111011001011101100101100011011011011101000101010101011110011100011100000111010100011100011100000101011101011100110101001011010000111100011100011111000001111101100101011011001011001011110 e889beec8bb2ecb1b6e8aabce383a8e382bae6a5a1e291a6eb8fa7e889beec8bb2ecb1b6e8aabce383a8e382bae6a5a1e38f83ecad965e
UHC 艾싲챶誼ヨズ楡⑦돧艾싲챶誼ヨズ楡㏃쭖^ 11100100111101011001101011101011101010101000001111101011111111101010101111101000101010111011101011101010111110001010100011101101100010011010101111100100111101011001101011101011101010101000001111101011111111101010101111101000101010111011101011101010111110001010011111101100101001111000111001011110 e4f59aebaa83ebfeabe8abbaeaf8a8ed89abe4f59aebaa83ebfeabe8abbaeaf8a7eca78e5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)