To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?キ猷??嚴щ?援?┃??た吾 00111111001111110011111111100010100001100011111110000011010011001001011101010001001111110011111110011010100011101000010010001011001111111000100110000111001111111000010010101011001111110011111110000010101111011000110011100001 3f3f3fe2863f834c97513f3f9a8e848b3f89873f84ab3f3f82bd8ce1
EUC-JP ???竊?キ猷??嚴щ?援?┃洹?た吾 001111110011111100111111111000111110011000111111101001011010110111001101101100100011111100111111110100111110111010100111111010110011111110110001111001110011111110101000101011011000111111000111101110100011111110100100101111111011100011100011 3f3f3fe3e63fa5adcdb23f3fd3eea7eb3fb1e73fa8ad8fc7ba3fa4bfb8e3
UTF-8 捻뀁뮆竊섋キ猷앺뮍嚴щ엨援ㅹ┃洹섎た吾 1110111110100110101001001110101110000000100000011110101110101110100001101110011110101011100010101110110010000100100010111110001110000010101011011110011110001100101101111110110010010101101110101110101110101110100011011110010110011010101101001101000110001001111011001001011110101000111001101000111110110100111000111000010110111001111000101001010010000011111001101011010010111001111011001000010010001110111000111000000110011111111001011001000010111110 efa6a4eb8081ebae86e7ab8aec848be382ade78cb7ec95baebae8de59ab4d189ec97a8e68fb4e385b9e29483e6b4b9ec848ee3819fe590be
UHC 捻뀁뮆竊섋キ猷앺뮍嚴щ엨援ㅹ┃洹섎た吾 1110011011110111101100101110110010010010100101011110111110111100100110001110100010101011101011011110101110100011100111011110110110010010100110101110010111110001101011001110101110011110100000011110101010110101101001001110100110100110101011011110101010110111100110001110101110101010101111111110011111101110 e6f7b2ec9295efbc98e8abadeba39ded929ae5f1aceb9e81eab5a4e9a6adeab798ebaabfe7ee

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)