To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 午??踰??臾??阿????“純??癲??源? 10001100110111110011111100111111111001101111101000111111001111111110010001101011001111110011111110001000101000100011111100111111001111110011111110000001011001111000111110000011001111110011111111100001100111110011111100111111100011001011100100111111 8cdf3f3fe6fa3f3fe46b3f3f88a23f3f3f3f81678f833f3fe19f3f3f8cb93f
EUC-JP 午??踰??臾??阿??沅?“純??癲??源? 101110001110000100111111001111111110110011111100001111110011111111100111110011000011111100111111101100001010010000111111001111111000111111000110111010010011111110100001110010001011110111100011001111110011111111100010101000010011111100111111101110001011101100111111 b8e13f3fecfc3f3fe7cc3f3fb0a43f3f8fc6e93fa1c8bde33f3fe2a13f3fb8bb3f
UTF-8 午닿퓥踰곻쭛臾딄퉿阿녹빆沅잒“純볧뜗癲쀫ㅏ源췇 111001011000110110001000111010111000101110111111111011011001001110100101111010001011100010110000111010101011001110111011111011001010110110011011111010001000011110111110111010111001010010000100111011011000100110111111111010011001100010111111111010111000010110111001111010111011100110000110111001101011001010000101111011001001111010010010111000101000000010011100111001111011010010010100111010111011001110100111111010111001110010010111111001111001100110110010111011001000000010101011111000111000010110001111111001101011101010010000111011001011011110000111 e58d88eb8bbfed93a5e8b8b0eab3bbecad9be887beeb9484ed89bfe998bfeb85b9ebb986e6b285ec9e92e2809ce7b494ebb3a7eb9c97e799b2ec80abe3858fe6ba90ecb787
UHC 午닿퓥踰곻쭛臾딄퉿阿녹빆沅잒“純볧뜗癲쀫ㅏ源췇 11100111111011011011010011101010101111111000111011101011101100101000000111101111101001111001000111101011101011001000101011101010101110011001011111100100101110011011001111101100100101011010110111101010101101101001111111101000101000011011000011100010111011011001001111101101100011011001101011101111101001101001011111101011101001001011111111101010101110011010111001000010 e7edb4eabf8eebb281efa791ebac8aeab997e4b9b3ec95adeab69fe8a1b0e2ed93ed8d9aefa697eba4bfeab9ae42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)