To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 艾??瀛??芯??暗??艾??瀛??芯??暗??^ 111001001000100000111111001111111110000001101001001111110011111110010000011000110011111100111111100010001100001100111111001111111110010010001000001111110011111111100000011010010011111100111111100100000110001100111111001111111000100011000011001111110011111101011110 e4883f3fe0693f3f90633f3f88c33f3fe4883f3fe0693f3f90633f3f88c33f3f5e
EUC-JP 艾??瀛??芯??暗??艾??瀛??芯??暗??^ 111001111110100000111111001111111101111111001010001111110011111110111111110001000011111100111111101100001100010100111111001111111110011111101000001111110011111111011111110010100011111100111111101111111100010000111111001111111011000011000101001111110011111101011110 e7e83f3fdfca3f3fbfc43f3fb0c53f3fe7e83f3fdfca3f3fbfc43f3fb0c53f3f5e
UTF-8 艾똿궛瀛룒뢜芯륂뢚暗땹뢏艾똿궛瀛룒뢜芯륂뢚暗땹뢏^ 11101000100010011011111011101011100110001011111111101010101101101001101111100111100000001001101111101011101000111001001011101011101000101001110011101000100010101010111111101011101001011000001011101011101000101001101011100110100110101001011111101011100101011011100111101011101000101000111111101000100010011011111011101011100110001011111111101010101101101001101111100111100000001001101111101011101000111001001011101011101000101001110011101000100010101010111111101011101001011000001011101011101000101001101011100110100110101001011111101011100101011011100111101011101000101000111101011110 e889beeb98bfeab69be7809beba392eba29ce88aafeba582eba29ae69a97eb95b9eba28fe889beeb98bfeab69be7809beba392eba29ce88aafeba582eba29ae69a97eb95b9eba28f5e
UHC 艾똿궛瀛룒뢜芯륂뢚暗땹뢏艾똿궛瀛룒뢜芯륂뢚暗땹뢏^ 11100100111101011000110010000101100000101011000011100111101110101000111110001111100011110101011111100100101001011000111111101101100011110101010111100100110111101000101110001111100011110100101011100100111101011000110010000101100000101011000011100111101110101000111110001111100011110101011111100100101001011000111111101101100011110101010111100100110111101000101110001111100011110100101001011110 e4f58c8582b0e7ba8f8f8f57e4a58fed8f55e4de8b8f8f4ae4f58c8582b0e7ba8f8f8f57e4a58fed8f55e4de8b8f8f4a5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)