To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 昻??裕??蹂??碍??違?┏遺??孃り?? 11111010110100000011111100111111100101110101010000111111001111111110011011111000001111110011111110001010010101100011111100111111100010001110000100111111100001001010110010001000111000100011111100111111100110110110111110000010111010000011111100111111 fad03f3f97543f3fe6f83f3f8a563f3f88e13f84ac88e23f3f9b6f82e83f3f
EUC-JP ???裕??蹂??碍?IJ違?┏遺??孃り?? 0011111100111111001111111100110110110101001111110011111111101100111110100011111100111111101100111011011100111111100011111010100110100110101100001110001100111111101010001010111010110000111001000011111100111111110101011101000010100100111010100011111100111111 3f3f3fcdb53f3fecfa3f3fb3b73f8fa9a6b0e33fa8aeb0e43f3fd5d0a4ea3f3f
UTF-8 昻뉗떝裕꾡슭蹂좎젘碍⑸IJ違뗦┏遺얜꺊孃り였履 1110011010011000101110111110101110001001100101111110101110010110100111011110100010100011100101011110101010111110101000011110110010001010101011011110100010111001100000101110110010100010100011101110110010100000100110001110011110100010100011011110001010010001101110001100010010110010111010011000000110010101111010111001011110100110111000101001010010001111111010011000000110111010111011001001011010011100111010101011101010001010111001011010110110000011111000111000001010001010111011001001100010000000111011111010011110011111 e698bbeb8997eb969de8a395eabea1ec8aade8b982eca28eeca098e7a28de291b8c4b2e98195eb97a6e2948fe981baec969ceaba8ae5ad83e3828aec9880efa79f
UHC 昻뉗떝裕꾡슭蹂좎젘碍⑸IJ違뗦┏遺얜꺊孃り였履 1110010011101001100001111110110010001011101100111110101110101110100001001110010010111101101111101110101110110011101000001110110010100000100101001110010011110100101010011110101110101000101001101110101011011110100010111110011010100110101011101110101110110110101111101110101110000011101100011110010110111110101010101110101010111111101101001110110010101010 e4e987ec8bb3ebae84e4bdbeebb3a0eca094e4f4a9eba8a6eade8be6a6aeebb6beeb83b1e5beaaeabfb4ecaa

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)