To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???竊?キ擬??孃る?維?????猷 0011111100111111001111111110001010000110001111111000001101001100100010110101101100111111001111111001101101101111100000101110100100111111100010001101101100111111001111110011111100111111001111111001011101010001 3f3f3fe2863f834c8b5b3f3f9b6f82e93f88db3f3f3f3f3f9751
EUC-JP ???竊?キ擬??孃る?維??沅??猷 00111111001111110011111111100011111001100011111110100101101011011011010110111100001111110011111111010101110100001010010011101011001111111011000011011101001111110011111110001111110001101110100100111111001111111100110110110010 3f3f3fe3e63fa5adb5bc3f3fd5d0a4eb3fb0dd3f3f8fc6e93f3fcdb2
UTF-8 捻뀁뮆竊섋キ擬듭춲孃る굟維쏁솻沅쎌슧猷 111011111010011010100100111010111000000010000001111010111010111010000110111001111010101110001010111011001000010010001011111000111000001010101101111001101001001110101100111010111001001110101101111011001011011010110010111001011010110110000011111000111000001010001011111010101011010110011111111001111011011010101101111011001000111110000001111011001000011010111011111001101011001010000101111011001000111010001100111011001000101010100111111001111000110010110111 efa6a4eb8081ebae86e7ab8aec848be382ade693aceb93adecb6b2e5ad83e3828beab59fe7b6adec8f81ec86bbe6b285ec8e8cec8aa7e78cb7
UHC 捻뀁뮆竊섋キ擬듭춲孃る굟維쏁솻沅쎌슧猷 1110011011110111101100101110110010010010100101011110111110111100100110001110100010101011101011011110101111110100101101011110110010101101100011101110010110111110101010101110101110000010100001111110101110101011100110111110011110011001101100001110101010110110101111011110110010011010101100011110101110100011 e6f7b2ec9295efbc98e8abadebf4b5ecad8ee5beaaeb8287ebab9be799b0eab6bdec9ab1eba3

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)