To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 亦??楢??宥??娃??徇???λ?霓??喩? 10010110100100100011111100111111100100111110100000111111001111111001011101000111001111110011111110001000101000010011111100111111100111000110110100111111001111110011111110000011110010010011111111101000101111010011111100111111100110100110011100111111 96923f3f93e83f3f97473f3f88a13f3f9c6d3f3f3f83c93fe8bd3f3f9a673f
EUC-JP 亦??楢??宥??娃??徇??洹λ?霓??喩? 110010111111001000111111001111111100011011101010001111110011111111001101101010000011111100111111101100001010001100111111001111111101011111001110001111110011111110001111110001111011101010100110110010110011111111110000101111110011111100111111110100111100100000111111 cbf23f3fc6ea3f3fcda83f3fb0a33f3fd7ce3f3f8fc7baa6cb3ff0bf3f3fd3c83f
UTF-8 亦껁끇楢쇌풚宥룸퓱娃딅뱽徇꾢뭣洹λ㎛霓얠떜喩쁁 1110010010111010101001101110101010111011100000011110101110000001100001111110011010100101101000101110110010000111100011001110110110010010100110101110010110101110101001011110101110100011101110001110110110010011101100011110010110101000100000111110101110010100100001011110101110110001101111011110010110111110100001111110101010111110101000101110101110101101101000111110011010110100101110011100111010111011111000111000111010011011111010011001110010010011111011001001011010100000111010111001011010011100111001011001011010101001111011001000000110000001 e4baa6eabb81eb8187e6a5a2ec878ced929ae5aea5eba3b8ed93b1e5a883eb9485ebb1bde5be87eabea2ebada3e6b4b9cebbe38e9be99c93ec96a0eb969ce596a9ec8181
UHC 亦껁끇楢쇌풚宥룸퓱娃딅뱽徇꾢뭣洹λ㎛霓얠떜喩쁁 11100110101100101000001111100011100001011011101111101010111110011011110011100100101111101001110111101010111010011011011111101011101111111001011111101000110111111000101011101011100100111010001111100010110111111000010011100101101110011011110111101010101101111010010111101011101001111010110111100111111001111011111011101100100010111011001011101010111001111001100001000010 e6b283e385bbeaf9bce4be9deae9b7ebbf97e8df8aeb93a3e2df84e5b9bdeab7a5eba7ade7e7beec8bb2eae79842

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)