To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ??ビ⊂??八健ダ????濯???⊂??濯 0011111100111111100000110111001010000001101111000011111100111111100101001010101010001100100100101000001101011111001111110011111100111111001111111001000111110011001111110011111100111111100000011011110000111111001111111001000111110011 3f3f837281bc3f3f94aa8c92835f3f3f3f3f91f33f3f3f81bc3f3f91f3
EUC-JP ??ビ⊂??八健ダ?薏??濯???⊂??濯 00111111001111111010010111010011101000101011111000111111001111111100100010101100101101111111001010100101110000000011111110001111110110011101111000111111001111111100001011110101001111110011111100111111101000101011111000111111001111111100001011110101 3f3fa5d3a2be3f3fc8acb7f2a5c03f8fd9de3f3fc2f53f3f3fa2be3f3fc2f5
UTF-8 룶핊ビ⊂룶웩八健ダ룫薏룶웩濯룶핊㈛⊂룶웩濯 111010111010001110110110111011011001010110001010111000111000001110010011111000101000101010000010111010111010001110110110111011001001101110101001111001011000010110101011111001011000000110100101111000111000001110000000111010111010001110101011111010001001011010001111111010111010001110110110111011001001101110101001111001101011111110101111111010111010001110110110111011011001010110001010111000111000100010011011111000101000101010000010111010111010001110110110111011001001101110101001111001101011111110101111 eba3b6ed958ae38393e28a82eba3b6ec9ba9e585abe581a5e38380eba3abe8968feba3b6ec9ba9e6bfafeba3b6ed958ae3889be28a82eba3b6ec9ba9e6bfaf
UHC 룶핊ビ⊂룶웩八健ダ룫薏룶웩濯룶핊㈛⊂룶웩濯 100011111010101111000000100011111010101111010011101000011111100010001111101010111100000010100001111110001010001011001011111011011010101111000000100011111010001011101011111110111000111110101011110000001010000111110110111110111000111110101011110000001000111110101001110011001010000111111000100011111010101111000000101000011111011011111011 8fabc08fabd3a1f88fabc0a1f8a2cbedabc08fa2ebfb8fabc0a1f6fb8fabc08fa9cca1f88fabc0a1f6fb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)