To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?キ猷≪?鈺??維?Ⅴ貫韋??揄?? 0011111100111111001111111110001010000110001111111000001101001100100101110101000110000001111000010011111111111011110001000011111100111111100010001101101100111111100001110101100010001010110100011110100011101000001111110011111110011101100010010011111100111111 3f3f3fe2863f834c975181e13ffbc43f3f88db3f87588ad1e8e83f3f9d893f3f
EUC-JP ???竊?キ猷≪?鈺??維??貫韋??揄?? 0011111100111111001111111110001111100110001111111010010110101101110011011011001010100010111000110011111110001111111000111101010100111111001111111011000011011101001111110011111110110100110100111111000011101010001111110011111111011001111010010011111100111111 3f3f3fe3e63fa5adcdb2a2e33f8fe3d53f3fb0dd3f3fb4d3f0ea3f3fd9e93f3f
UTF-8 捻뀁뮆竊섋キ猷≪쒜鈺곗뼦維쏉Ⅴ貫韋귟맱揄몄쵃 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001011111000111000001010101101111001111000110010110111111000101000100110101010111011001001001010011100111010011000100010111010111010101011001110010111111010111011110010100110111001111011011010101101111011001000111110001001111000101000010110100100111010001011001010101011111010011001111110001011111010101011011110011111111010111010011110110001111001101000111110000100111010111010101010000100111011001011010110000011 efa6a4eb8081ebae86e7ab8aec848be382ade78cb7e289aaec929ce988baeab397ebbca6e7b6adec8f89e285a4e8b2abe99f8beab79feba7b1e68f84ebaa84ecb583
UHC 捻뀁뮆竊섋キ猷≪쒜鈺곗뼦維쏉Ⅴ貫韋귟맱揄몄쵃 1110011011110111101100101110110010010010100101011110111110111100100110001110100010101011101011011110101110100011101000011110110010111110101011101110100010101101101100001110110010010110101010011110101110101011100110111110111110100101101101001100111010111011111010101101111110000010111010001001000010111000111010101111000110111000111011001010110010000101 e6f7b2ec9295efbc98e8abadeba3a1ecbeaee8adb0ec96a9ebab9befa5b4cebbeadf82e890b8eaf1b8ecac85

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)