To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN ????領??脚??究????領??脚??究^ 0011111100111111001111110011111110010111110011000011111100111111100010110111001000111111001111111000101110000110001111110011111100111111001111111001011111001100001111110011111110001011011100100011111100111111100010111000011001011110 3f3f3f3f97cc3f3f8b723f3f8b863f3f3f3f97cc3f3f8b723f3f8b865e
EUC-JP ????領??脚??究????領??脚??究^ 0011111100111111001111110011111111001110110011100011111100111111101101011101001100111111001111111011010111100110001111110011111100111111001111111100111011001110001111110011111110110101110100110011111100111111101101011110011001011110 3f3f3f3fcece3f3fb5d33f3fb5e63f3f3f3fcece3f3fb5d33f3fb5e65e
UTF-8 렺셔봬셔領렑렻脚렋렻究렺셔봬셔領렑렻脚렋렻究^ 11101011101000001011101011101100100001011001010011101011101101001010110011101100100001011001010011101001101000001001100011101011101000001001000111101011101000001011101111101000100001001001101011101011101000001000101111101011101000001011101111100111101010011011011011101011101000001011101011101100100001011001010011101011101101001010110011101100100001011001010011101001101000001001100011101011101000001001000111101011101000001011101111101000100001001001101011101011101000001000101111101011101000001011101111100111101010011011011001011110 eba0baec8594ebb4acec8594e9a098eba091eba0bbe8849aeba08beba0bbe7a9b6eba0baec8594ebb4acec8594e9a098eba091eba0bbe8849aeba08beba0bbe7a9b65e
UHC 렺셔봬셔領렑렻脚렋렻究렺셔봬셔領렑렻脚렋렻究^ 100011101100001010111100110001011011101011000100101111001100010111010110110001011000111010100110100011101100001111001010110001011000111010100010100011101100001111001111101111001000111011000010101111001100010110111010110001001011110011000101110101101100010110001110101001101000111011000011110010101100010110001110101000101000111011000011110011111011110001011110 8ec2bcc5bac4bcc5d6c58ea68ec3cac58ea28ec3cfbc8ec2bcc5bac4bcc5d6c58ea68ec3cac58ea28ec3cfbc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)