To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????U???????????UB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110101010101000010 3f3f3f3f3f3f3f3f3f3f3f553f3f3f3f3f3f3f3f3f3f3f5542
SJIS-WIN 弔?制絅?縡?醍???U弔?制絅?縡?醍???UB 1001001010100010001111111001000010100111111000110100010000111111111000110111000100111111100100011110011100111111001111110011111101010101100100101010001000111111100100001010011111100011010001000011111111100011011100010011111110010001111001110011111100111111001111110101010101000010 92a23f90a7e3443fe3713f91e73f3f3f5592a23f90a7e3443fe3713f91e73f3f3f5542
EUC-JP 弔?制絅?縡?醍???U弔?制絅?縡?醍???UB 1100010010100100001111111100000010101001111001011010010100111111111001011101001000111111110000101110100100111111001111110011111101010101110001001010010000111111110000001010100111100101101001010011111111100101110100100011111111000010111010010011111100111111001111110101010101000010 c4a43fc0a9e5a53fe5d23fc2e93f3f3f55c4a43fc0a9e5a53fe5d23fc2e93f3f3f5542
UTF-8 弔렟制絅뱌縡렕醍닺렪렗U弔렟制絅뱌縡렕醍닺렪렗UB 111001011011110010010100111010111010000010011111111001011000100010110110111001111011010110000101111010111011000110001100111001111011100010100001111010111010000010010101111010011000011010001101111010111000101110111010111010111010000010101010111010111010000010010111010101011110010110111100100101001110101110100000100111111110010110001000101101101110011110110101100001011110101110110001100011001110011110111000101000011110101110100000100101011110100110000110100011011110101110001011101110101110101110100000101010101110101110100000100101110101010101000010 e5bc94eba09fe588b6e7b585ebb18ce7b8a1eba095e9868deb8bbaeba0aaeba09755e5bc94eba09fe588b6e7b585ebb18ce7b8a1eba095e9868deb8bbaeba0aaeba0975542
UHC 弔렟制絅뱌縡렕醍닺렪렗U弔렟制絅뱌縡렕醍닺렪렗UB 1111000011000000100011101011000011110000101001001100110011100111101110011111001011101110101011011000111010101010111100001011010110110100111010001000111010111000100011101010110001010101111100001100000010001110101100001111000010100100110011001110011110111001111100101110111010101101100011101010101011110000101101011011010011101000100011101011100010001110101011000101010101000010 f0c08eb0f0a4cce7b9f2eead8eaaf0b5b4e88eb88eac55f0c08eb0f0a4cce7b9f2eead8eaaf0b5b4e88eb88eac5542

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)