To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 悟??議??攸??悟??議?┸碎??筌 100011001110010100111111001111111000101101100011001111110011111110011101101111110011111100111111100011001110010100111111001111111000101101100011001111111000010010111101111000011110101000111111001111111110001010100011 8ce53f3f8b633f3f9dbf3f3f8ce53f3f8b633f84bde1ea3f3fe2a3
EUC-JP 悟??議??攸??悟??議?┸碎??筌 101110001110011100111111001111111011010111000100001111110011111111011010110000010011111100111111101110001110011100111111001111111011010111000100001111111010100010111111111000101110110000111111001111111110010010100101 b8e73f3fb5c43f3fdac13f3fb8e73f3fb5c43fa8bfe2ec3f3fe4a5
UTF-8 悟귣슢議곩궟攸낆졒悟귣슢議곻┸碎ㅼ졋筌 111001101000001010011111111010101011011110100011111011001000101010100010111010001010110110110000111010101011001110101001111010101011011010011111111001101001010010111000111010111000001010000110111011001010000110010010111001101000001010011111111010101011011110100011111011001000101010100010111010001010110110110000111010101011001110111011111000101001010010111000111001111010001010001110111000111000010110111100111011001010000110001011111001111010110110001100 e6829feab7a3ec8aa2e8adb0eab3a9eab69fe694b8eb8286eca192e6829feab7a3ec8aa2e8adb0eab3bbe294b8e7a28ee385bceca18be7ad8c
UHC 悟귣슢議곩궟攸낆졒悟귣슢議곻┸碎ㅼ졋筌 1110011111110110100000101110101110011010101011101110110010100001100000011110010110000010101100101110101011110010100001011110110010100000101111111110011111110110100000101110101110011010101011101110110010100001100000011110111110100110101111111110000111101111101001001110110010100000101110101110111110100111 e7f682eb9aaeeca181e582b2eaf285eca0bfe7f682eb9aaeeca181efa6bfe1efa4eca0baefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)