To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 弔?吟?游絲??趙貊?縡??絅?鎖??粟 100100101010001000111111100010111110000100111111100111111110000011100011010011100011111100111111111001101110001011100110101110110011111111100011011100010011111100111111111000110100010000111111100011011011110100111111001111111000100010111110 92a23f8be13f9fe0e34e3f3fe6e2e6bb3fe3713f3fe3443f8dbd3f3f88be
EUC-JP 弔?吟?游絲??趙貊?縡?饔絅?鎖??粟 1100010010100100001111111011011011100011001111111101111011100010111001011010111100111111001111111110110011100100111011001011110100111111111001011101001000111111100011111110100011101111111001011010010100111111101110101011111100111111001111111011000011000000 c4a43fb6e33fdee2e5af3f3fece4ecbd3fe5d23f8fe8efe5a53fbabf3f3fb0c0
UTF-8 弔렲吟렞游絲렕렟趙貊긺縡렕饔絅뤈鎖쵌곧粟 111001011011110010010100111010111010000010110010111001011001000010011111111010111010000010011110111001101011100010111000111001111011010110110010111010111010000010010101111010111010000010011111111010001011011010011001111010001011001010001010111010101011100010111010111001111011100010100001111010111010000010010101111010011010010110010100111001111011010110000101111010111010010010001000111010011000111010010110111011001011010110001100111010101011001110100111111001111011001010011111 e5bc94eba0b2e5909feba09ee6b8b8e7b5b2eba095eba09fe8b699e8b28aeab8bae7b8a1eba095e9a594e7b585eba488e98e96ecb58ceab3a7e7b29f
UHC 弔렲吟렞游絲렕렟趙貊긺縡렕饔絅뤈鎖쵌곧粟 11110000110000001000111010111111111010111110000110001110101011111110101011111101110111101110101010001110101010101000111010110000111100001110000111011000111001111011000111100111111011101010110110001110101010101110100010111101110011001110011110001111101110001110000111110000101011001000111010110000111100001110000111011000 f0c08ebfebe18eafeafddeea8eaa8eb0f0e1d8e7b1e7eead8eaae8bdcce78fb8e1f0ac8eb0f0e1d8

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)