To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????^ 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 歪????????餘??節ら?窈??鈺??^ 10011000011000110011111100111111001111110011111100111111001111110011111100111111111010010101000000111111001111111001000011011111100000101110011100111111111000100111011100111111001111111111101111000100001111110011111101011110 98633f3f3f3f3f3f3f3fe9503f3f90df82e73fe2773f3ffbc43f3f5e
EUC-JP 歪?????旿??餘??節ら?窈??鈺??^ 11001111110001000011111100111111001111110011111100111111100011111100000111110100001111110011111111110001101100010011111100111111110000001110000110100100111010010011111111100011110110000011111100111111100011111110001111010101001111110011111101011110 cfc43f3f3f3f3f8fc1f43f3ff1b13f3fc0e1a4e93fe3d83f3f8fe3d53f3f5e
UTF-8 歪귨쉠樂됮죺旿⑵춯餘됮굚節ら썖窈붻쑊鈺뚧뿬^ 11100110101011011010101011101010101101111010100011101100100010011010000011101111101001101011111111101011100100001010111011101100101000111011101011100110100101111011111111100010100100011011010111101100101101101010111111101001101001001001100011101011100100001010111011101010101101011001101011100111101011111000000011100011100000101000100111101100100011011001011011100111101010101000100011101011101101101011101111101100100100011000101011101001100010001011101011101011100110101010011111101011101111111010110001011110 e6adaaeab7a8ec89a0efa6bfeb90aeeca3bae697bfe291b5ecb6afe9a498eb90aeeab59ae7af80e38289ec8d96e7aa88ebb6bbec918ae988baeb9aa7ebbfac5e
UHC 歪귨쉠樂됮죺旿⑵춯餘됮굚節ら썖窈붻쑊鈺뚧뿬^ 11101000111000001000001011101111101111011010101011101000111110011000100111101001101000011001010011100111111110101010100111101000101011011000110011100110101011101000100111101001100000101000001011101111101111011010101011101001100110111000100111101001101000011001010011101000100111001010100111101000101011011000110011100110100101111010110001011110 e8e082efbdaae8f989e9a194e7faa9e8ad8ce6ae89e98282efbdaae99b89e9a194e89ca9e8ad8ce697ac5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)