To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????^ 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 而?媛篩??張?粧?而?媛篩??張???^ 100011101010011100111111100101010101000111100010101111110011111100111111100100101010001100111111100011111100111100111111100011101010011100111111100101010101000111100010101111110011111100111111100100101010001100111111001111110011111101011110 8ea73f9551e2bf3f3f92a33f8fcf3f8ea73f9551e2bf3f3f92a33f3f3f5e
EUC-JP 而?媛篩??張?粧?而?媛篩??張?獐?^ 1011110010101001001111111100100110110010111001001100000100111111001111111100010010100101001111111011111011010001001111111011110010101001001111111100100110110010111001001100000100111111001111111100010010100101001111111000111111001011101110100011111101011110 bca93fc9b2e4c13f3fc4a53fbed13fbca93fc9b2e4c13f3fc4a53f8fcbba3f5e
UTF-8 而렲媛篩렫렲張렜粧렋而렲媛篩렫렲張렜獐렜^ 11101000100000001000110011101011101000001011001011100101101010101001101111100111101011111010100111101011101000001010101111101011101000001011001011100101101111001011010111101011101000001001110011100111101100101010011111101011101000001000101111101000100000001000110011101011101000001011001011100101101010101001101111100111101011111010100111101011101000001010101111101011101000001011001011100101101111001011010111101011101000001001110011100111100011011001000011101011101000001001110001011110 e8808ceba0b2e5aa9be7afa9eba0abeba0b2e5bcb5eba09ce7b2a7eba08be8808ceba0b2e5aa9be7afa9eba0abeba0b2e5bcb5eba09ce78d90eba09c5e
UHC 而렲媛篩렫렲張렜粧렋而렲媛篩렫렲張렜獐렜^ 1110110010111011100011101011111111101010101100001101111011101000100011101011100110001110101111111110110111100101100011101010111011101101111100101000111010100010111011001011101110001110101111111110101010110000110111101110100010001110101110011000111010111111111011011110010110001110101011101110110111101111100011101010111001011110 ecbb8ebfeab0dee88eb98ebfede58eaeedf28ea2ecbb8ebfeab0dee88eb98ebfede58eaeedef8eae5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)