To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 而肯???基???良而肯???基???粱^ 1000111010100111100011010110110100111111001111110011111110001010111011100011111100111111001111111001011111000111100011101010011110001101011011010011111100111111001111111000101011101110001111110011111100111111111000101110101101011110 8ea78d6d3f3f3f8aee3f3f3f97c78ea78d6d3f3f3f8aee3f3f3fe2eb5e
EUC-JP 而肯???基???良而肯???基???粱^ 1011110010101001101110011100111000111111001111110011111110110100111100000011111100111111001111111100111011001001101111001010100110111001110011100011111100111111001111111011010011110000001111110011111100111111111001001110110101011110 bca9b9ce3f3f3fb4f03f3f3fcec9bca9b9ce3f3f3fb4f03f3f3fe4ed5e
UTF-8 而肯렫쾰렗基렱비렱良而肯렫쾰렗基렱비렱粱^ 11101000100000001000110011101000100000101010111111101011101000001010101111101100101111101011000011101011101000001001011111100101100111111011101011101011101000001011000111101011101110011000010011101011101000001011000111101000100010011010111111101000100000001000110011101000100000101010111111101011101000001010101111101100101111101011000011101011101000001001011111100101100111111011101011101011101000001011000111101011101110011000010011101011101000001011000111100111101100101011000101011110 e8808ce882afeba0abecbeb0eba097e59fbaeba0b1ebb984eba0b1e889afe8808ce882afeba0abecbeb0eba097e59fbaeba0b1ebb984eba0b1e7b2b15e
UHC 而肯렫쾰렗基렱비렱良而肯렫쾰렗基렱비렱粱^ 1110110010111011110100001110100110001110101110011100010011101011100011101010110011010000111100011000111010111110101110101111000110001110101111101101010111011110111011001011101111010000111010011000111010111001110001001110101110001110101011001101000011110001100011101011111010111010111100011000111010111110110101011101110001011110 ecbbd0e98eb9c4eb8eacd0f18ebebaf18ebed5deecbbd0e98eb9c4eb8eacd0f18ebebaf18ebed5dc5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)