To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?キ猷??嚴щ????惟????? 001111110011111100111111111000101000011000111111100000110100110010010111010100010011111100111111100110101000111010000100100010110011111100111111001111110011111110001000110100100011111100111111001111110011111100111111 3f3f3fe2863f834c97513f3f9a8e848b3f3f3f3f88d23f3f3f3f3f
EUC-JP ???竊?キ猷??嚴щ????惟??馹?? 0011111100111111001111111110001111100110001111111010010110101101110011011011001000111111001111111101001111101110101001111110101100111111001111110011111100111111101100001101010000111111001111111000111111101001101000010011111100111111 3f3f3fe3e63fa5adcdb23f3fd3eea7eb3f3f3f3fb0d43f3f8fe9a13f3f
UTF-8 捻뀁뮆竊섋キ猷앺뮍嚴щ쵄溜띷첀惟뤿뜆馹싩솾 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100010111110001110000010101011011110011110001100101101111110110010010101101110101110101110101110100011011110010110011010101101001101000110001001111011001011010110000100111011111010011110001011111010111001110110110111111011001011001010000000111001101000001110011111111010111010010010111111111010111001110010000110111010011010011010111001111011001000101110101001111011001000011010111110 efa6a4eb8081ebae86e7ab8aec848be382ade78cb7ec95baebae8de59ab4d189ecb584efa78beb9db7ecb280e6839feba4bfeb9c86e9a6b9ec8ba9ec86be
UHC 捻뀁뮆竊섋キ猷앺뮍嚴щ쵄溜띷첀惟뤿뜆馹싩솾 111001101111011110110010111011001001001010010101111011111011110010011000111010001010101110101101111010111010001110011101111011011001001010011010111001011111000110101100111010111010110010000110111010101111111010001101111001101010101010001101111010101110111010001111111010111000110110001001111011001111000110011010111001111001100110110010 e6f7b2ec9295efbc98e8abadeba39ded929ae5f1acebac86eafe8de6aa8deaee8feb8d89ecf19ae799b2

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)