To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 壤??堯??絶??絶??鸚??節??絶??^ 1001101011011111001111110011111111101010100111110011111100111111100100001110001000111111001111111001000011100010001111110011111111101010010111110011111100111111100100001101111100111111001111111001000011100010001111110011111101011110 9adf3f3fea9f3f3f90e23f3f90e23f3fea5f3f3f90df3f3f90e23f3f5e
EUC-JP 壤??堯??絶??絶??鸚??節??絶??^ 1101010011100001001111110011111111110100101000010011111100111111110000001110010000111111001111111100000011100100001111110011111111110011110000000011111100111111110000001110000100111111001111111100000011100100001111110011111101011110 d4e13f3ff4a13f3fc0e43f3fc0e43f3ff3c03f3fc0e13f3fc0e43f3f5e
UTF-8 壤㎩ㅁ堯억슝絶묕풆絶쎾쉑鸚깁띃節뱄풌絶롳풖^ 11100101101000111010010011100011100011101010100111100011100001011000000111100101101000001010111111101100100101101011010111101100100010101001110111100111101101011011011011101011101011001001010111101101100100101000011011100111101101011011011011101100100011101011111011101100100010011001000111101001101110001001101011101010101110011000000111101011100111011000001111100111101011111000000011101011101100011000010011101101100100101000110011100111101101011011011011101011101000011011001111101101100100101001011001011110 e5a3a4e38ea9e38581e5a0afec96b5ec8a9de7b5b6ebac95ed9286e7b5b6ec8ebeec8991e9b89aeab981eb9d83e7af80ebb184ed928ce7b5b6eba1b3ed92965e
UHC 壤㎩ㅁ堯억슝絶묕풆絶쎾쉑鸚깁띃節뱄풌絶롳풖^ 11100101101111011010011111100101101001001011000111101000111010111011111011101111101111011011100111101111101111101001000111101111101111101000111011101111101111101001101111100101101111011010011111100101101001001011000111101001100011011011111011101111101111011011100111101111101111101001000111101111101111101000111011101111101111101001100101011110 e5bda7e5a4b1e8ebbeefbdb9efbe91efbe8eefbe9be5bda7e5a4b1e98dbeefbdb9efbe91efbe8eefbe995e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)