To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????ら?邑??孃る?裕?????亦?? 001111110011111100111111001111111000001011100111001111111001011101010111001111110011111110011011011011111000001011101001001111111001011101010100001111110011111100111111001111110011111110010110100100100011111100111111 3f3f3f3f82e73f97573f3f9b6f82e93f97543f3f3f3f3f96923f3f
EUC-JP ????ら?邑??孃る?裕??洧??亦?? 0011111100111111001111110011111110100100111010010011111111001101101110000011111100111111110101011101000010100100111010110011111111001101101101010011111100111111100011111100011110110100001111110011111111001011111100100011111100111111 3f3f3f3fa4e93fcdb83f3fd5d0a4eb3fcdb53f3f8fc7b43f3fcbf23f3f
UTF-8 麗몃쓹隣ら렟邑㏆폀孃る쪈裕곫갭洧좊튉亦낅퇂 111011111010011010001000111010111010101010000011111011001001001110111001111011111010011110110001111000111000001010001001111010111010000010011111111010011000001010010001111000111000111110000110111011011000111110000000111001011010110110000011111000111000001010001011111011001010101010001000111010001010001110010101111010101011001110101011111010101011000010101101111001101011010010100111111011001010001010001010111011011000101010001001111001001011101010100110111010111000001010000101111011011000011110000010 efa688ebaa83ec93b9efa7b1e38289eba09fe98291e38f86ed8f80e5ad83e3828becaa88e8a395eab3abeab0ade6b4a7eca28aed8a89e4baa6eb8285ed8782
UHC 麗몃쓹隣ら렟邑㏆폀孃る쪈裕곫갭洧좊튉亦낅퇂 111001101011000010111000111010111001110110010101111011001110010010101010111010011000111010110000111010111110100110100111111011111011110010001111111001011011111010101010111010111010010110000010111010111010111010000001111001101011000010111000111010101111101110100000111010111011100110011101111001101011001010000101111010111011011110010011 e6b0b8eb9d95ece4aae98eb0ebe9a7efbc8fe5beaaeba582ebae81e6b0b8eafba0ebb99de6b285ebb793

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)