To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????W????Jn}????W????Jn{^ 00111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111110100111111001111110011111100111111010101110011111100111111001111110011111101001010011011100111101101011110 3f3f3f3f573f3f3f3f4a6e7d3f3f3f3f573f3f3f3f4a6e7b5e
SJIS-WIN 蔗ク逵ゥW蔗ク逵ゥJn}蔗ク逵ゥW蔗ク逵ゥJn{^ 111001001111001010111000111001111001110010101001010101111110010011110010101110001110011110011100101010010100101001101110011111011110010011110010101110001110011110011100101010010101011111100100111100101011100011100111100111001010100101001010011011100111101101011110 e4f2b8e79ca957e4f2b8e79ca94a6e7de4f2b8e79ca957e4f2b8e79ca94a6e7b5e
EUC-JP 蔗ク逵ゥW蔗ク逵ゥJn}蔗ク逵ゥW蔗ク逵ゥJn{^ 1110100011110100100011101011100011101101111111001000111010101001010101111110100011110100100011101011100011101101111111001000111010101001010010100110111001111101111010001111010010001110101110001110110111111100100011101010100101010111111010001111010010001110101110001110110111111100100011101010100101001010011011100111101101011110 e8f48eb8edfc8ea957e8f48eb8edfc8ea94a6e7de8f48eb8edfc8ea957e8f48eb8edfc8ea94a6e7b5e
UTF-8 蔗ク逵ゥW蔗ク逵ゥJn}蔗ク逵ゥW蔗ク逵ゥJn{^ 111010001001010010010111111011111011110110111000111010011000000010110101111011111011110110101001010101111110100010010100100101111110111110111101101110001110100110000000101101011110111110111101101010010100101001101110011111011110100010010100100101111110111110111101101110001110100110000000101101011110111110111101101010010101011111101000100101001001011111101111101111011011100011101001100000001011010111101111101111011010100101001010011011100111101101011110 e89497efbdb8e980b5efbda957e89497efbdb8e980b5efbda94a6e7de89497efbdb8e980b5efbda957e89497efbdb8e980b5efbda94a6e7b5e
UHC 蔗?逵?W蔗?逵?Jn}蔗?逵?W蔗?逵?Jn{^ 111011011011110100111111110100001011000000111111010101111110110110111101001111111101000010110000001111110100101001101110011111011110110110111101001111111101000010110000001111110101011111101101101111010011111111010000101100000011111101001010011011100111101101011110 edbd3fd0b03f57edbd3fd0b03f4a6e7dedbd3fd0b03f57edbd3fd0b03f4a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)