To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 渦??意??鎖??碍???∽?誘??霓 1000100101010001001111110011111110001000110100110011111100111111100011011011110100111111001111111000101001010110001111110011111100111111100000011110010000111111100101110101010100111111001111111110100010111101 89513f3f88d33f3f8dbd3f3f8a563f3f3f81e43f97553f3fe8bd
EUC-JP 渦??意??鎖??碍??堉∽?誘??霓 10110001101100100011111100111111101100001101010100111111001111111011101010111111001111110011111110110011101101110011111100111111100011111011011111111101101000101110011000111111110011011011011000111111001111111111000010111111 b1b23f3fb0d53f3fbabf3f3fb3b73f3f8fb7fda2e63fcdb63f3ff0bf
UTF-8 渦깅맧意㏆㎠鎖듬겱碍⑸쵎堉∽쫩誘⑹굡霓 111001101011100010100110111010101011100110000101111010111010011110100111111001101000010010001111111000111000111110000110111000111000111010100000111010011000111010010110111010111001001110101100111010101011001010110001111001111010001010001101111000101001000110111000111011001011010110001110111001011010000010001001111000101000100010111101111011001010101110101001111010001010101010011000111000101001000110111001111010101011010110100001111010011001110010010011 e6b8a6eab985eba7a7e6848fe38f86e38ea0e98e96eb93aceab2b1e7a28de291b8ecb58ee5a089e288bdecaba9e8aa98e291b9eab5a1e99c93
UHC 渦깅맧意㏆㎠鎖듬겱碍⑸쵎堉∽쫩誘⑹굡霓 1110100010111110101100011110101110010000101100001110101111110010101001111110111110100111101100101110000111110000101101011110101110000001101111011110010011110100101010011110101110101100100100001110101110111100101000011110111110100110100000101110101110101111101010011110110010110001101101101110011111100111 e8beb1eb90b0ebf2a7efa7b2e1f0b5eb81bde4f4a9ebac90ebbca1efa682ebafa9ecb1b6e7e7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)