To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????®????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111101011100011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3fae3f3f3f3f3f
SJIS-WIN ???椅ら?淫??孃る―認??兢??? 001111110011111100111111100010001101011010000010111001110011111110001000111110100011111100111111100110110110111110000010111010011000000101011100100101000100011000111111001111111001100101011101001111110011111100111111 3f3f3f88d682e73f88fa3f3f9b6f82e9815c94463f3f995d3f3f3f
EUC-JP ???椅ら?淫??孃る―認®?兢??? 0011111100111111001111111011000011011000101001001110100100111111101100001111110000111111001111111101010111010000101001001110101110100001101111011100011110100111100011111010001011101110001111111101000110111110001111110011111100111111 3f3f3fb0d8a4e93fb0fc3f3fd5d0a4eba1bdc7a78fa2ee3fd1be3f3f3f
UTF-8 麗몃쓹椅ら렟淫뉛폀孃る―認®솾兢隣⑵린 1110111110100110100010001110101110101010100000111110110010010011101110011110011010100100100001011110001110000010100010011110101110100000100111111110011010110111101010111110101110001001100110111110110110001111100000001110010110101101100000111110001110000010100010111110001010000000100101011110100010101010100011011100001010101110111011001000011010111110111001011000010110100010111011111010011110110001111000101001000110110101111010111010011010110000 efa688ebaa83ec93b9e6a485e38289eba09fe6b7abeb899bed8f80e5ad83e3828be28095e8aa8dc2aeec86bee585a2efa7b1e291b5eba6b0
UHC 麗몃쓹椅ら렟淫뉛폀孃る―認®솾兢隣⑵린 1110011010110000101110001110101110011101100101011110101111110101101010101110100110001110101100001110101111100010100001111110111110111100100011111110010110111110101010101110101110100001101010101110110011100011101000101110011110011001101100101101000011100111111011001110010010101001111010001011100010110000 e6b0b8eb9d95ebf5aae98eb0ebe287efbc8fe5beaaeba1aaece3a2e799b2d0e7ece4a9e8b8b0

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)