To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 癲?8意??猿??沃??日??唯??認?? 1110000110011111001111111000001001010111100010001101001100111111001111111000100110001110001111110011111110010111100000000011111100111111100100111111101000111111001111111001011101000010001111110011111110010100010001100011111100111111 e19f3f825788d33f3f898e3f3f97803f3f93fa3f3f97423f3f94463f3f
EUC-JP 癲?8意??猿??沃??日??唯??認?? 1110001010100001001111111010001110111000101100001101010100111111001111111011000111101110001111110011111111001101111000000011111100111111110001101111110000111111001111111100110110100011001111110011111111000111101001110011111100111111 e2a13fa3b8b0d53f3fb1ee3f3fcde03f3fc6fc3f3fcda33f3fc7a73f3f
UTF-8 癲쒕8意쎾㎤猿낆툗沃섃뫜日듿보唯몃뀪認욑쬅 111001111001100110110010111011001001001010010101111011111011110010011000111001101000010010001111111011001000111010111110111000111000111010100100111001111000110010111111111010111000001010000110111011011000100010010111111001101011001010000011111011001000010010000011111010111010101110011100111001101001011110100101111010111001001110111111111010111011001110110100111001011001010010101111111010111010101010000011111010111000000010101010111010001010101010001101111011001001101010010001111011001010110010000101 e799b2ec9295efbc98e6848fec8ebee38ea4e78cbfeb8286ed8897e6b283ec8483ebab9ce697a5eb93bfebb3b4e594afebaa83eb80aae8aa8dec9a91ecac85
UHC 癲쒕8意쎾㎤猿낆툗沃섃뫜日듿보唯몃뀪認욑쬅 111011111010011010011100111010111010001110111000111010111111001010011011111001011010011110101000111010101011101110000101111011001011100010001110111010001010101010011000111000101001000110111100111011001110110110001010111001011011101010111000111010101110011010111000111010111000010110100000111011001110001110011110111011111010011010011100 efa69ceba3b8ebf29be5a7a8eabb85ecb88ee8aa98e291bceced8ae5bab8eae6b8eb85a0ece39eefa69c

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)