To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 而?淫???張?而?醍?而?駿???咀 100011101010011100111111100010001111101000111111001111110011111110010010101000110011111110001110101001110011111110010001111001110011111110001110101001110011111110001111011110000011111100111111001111111001100111110000 8ea73f88fa3f3f3f92a33f8ea73f91e73f8ea73f8f783f3f3f99f0
EUC-JP 而?淫???張?而?醍?而?駿???咀 101111001010100100111111101100001111110000111111001111110011111111000100101001010011111110111100101010010011111111000010111010010011111110111100101010010011111110111101110110010011111100111111001111111101001011110010 bca93fb0fc3f3f3fc4a53fbca93fc2e93fbca93fbdd93f3f3fd2f2
UTF-8 而렲淫꿱렫렲張렜而렲醍렕而렲駿계렫렲咀 111010001000000010001100111010111010000010110010111001101011011110101011111010101011111110110001111010111010000010101011111010111010000010110010111001011011110010110101111010111010000010011100111010001000000010001100111010111010000010110010111010011000011010001101111010111010000010010101111010001000000010001100111010111010000010110010111010011010011110111111111010101011001110000100111010111010000010101011111010111010000010110010111001011001001010000000 e8808ceba0b2e6b7abeabfb1eba0abeba0b2e5bcb5eba09ce8808ceba0b2e9868deba095e8808ceba0b2e9a7bfeab384eba0abeba0b2e59280
UHC 而렲淫꿱렫렲張렜而렲醍렕而렲駿계렫렲咀 1110110010111011100011101011111111101011111000101011001011101000100011101011100110001110101111111110110111100101100011101010111011101100101110111000111010111111111100001011010110001110101010101110110010111011100011101011111111110001111001111011000011101000100011101011100110001110101111111110111010111010 ecbb8ebfebe2b2e88eb98ebfede58eaeecbb8ebff0b58eaaecbb8ebff1e7b0e88eb98ebfeeba

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)