To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 張ζ?堯??絶??絶??魚??節??絶??^ 100100101010001110000011110001000011111111101010100111110011111100111111100100001110001000111111001111111001000011100010001111110011111110001011100110110011111100111111100100001101111100111111001111111001000011100010001111110011111101011110 92a383c43fea9f3f3f90e23f3f90e23f3f8b9b3f3f90df3f3f90e23f3f5e
EUC-JP 張ζ?堯??絶??絶??魚??節??絶??^ 110001001010010110100110110001100011111111110100101000010011111100111111110000001110010000111111001111111100000011100100001111110011111110110101111110110011111100111111110000001110000100111111001111111100000011100100001111110011111101011110 c4a5a6c63ff4a13f3fc0e43f3fc0e43f3fb5fb3f3fc0e13f3fc0e43f3f5e
UTF-8 張ζ굢堯억슝絶묕풆絶쏈눆魚됮띃節뱄풌絶롳풖^ 111001011011110010110101110011101011011011101010101101011010001011100101101000001010111111101100100101101011010111101100100010101001110111100111101101011011011011101011101011001001010111101101100100101000011011100111101101011011011011101100100011111000100011101011100010001000011011101001101011011001101011101011100100001010111011101011100111011000001111100111101011111000000011101011101100011000010011101101100100101000110011100111101101011011011011101011101000011011001111101101100100101001011001011110 e5bcb5ceb6eab5a2e5a0afec96b5ec8a9de7b5b6ebac95ed9286e7b5b6ec8f88eb8886e9ad9aeb90aeeb9d83e7af80ebb184ed928ce7b5b6eba1b3ed92965e
UHC 張ζ굢堯억슝絶묕풆絶쏈눆魚됮띃節뱄풌絶롳풖^ 11101101111001011010010111100110100000101000100111101000111010111011111011101111101111011011100111101111101111101001000111101111101111101000111011101111101111101001101111101110100001111010010111100101111000001000100111101001100011011011111011101111101111011011100111101111101111101001000111101111101111101000111011101111101111101001100101011110 ede5a5e68289e8ebbeefbdb9efbe91efbe8eefbe9bee87a5e5e089e98dbeefbdb9efbe91efbe8eefbe995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)