To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ??????????????????????^ 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101011110 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f5e
SJIS-WIN 哀??????唯??鷹?????悠??純??^ 10001000101000110011111100111111001111110011111100111111001111111001011101000010001111110011111110010001111010010011111100111111001111110011111100111111100101110100100100111111001111111000111110000011001111110011111101011110 88a33f3f3f3f3f3f97423f3f91e93f3f3f3f3f97493f3f8f833f3f5e
EUC-JP 哀??????唯??鷹?????悠??純??^ 10110000101001010011111100111111001111110011111100111111001111111100110110100011001111110011111111000010111010110011111100111111001111110011111100111111110011011010101000111111001111111011110111100011001111110011111101011110 b0a53f3f3f3f3f3fcda33f3fc2eb3f3f3f3f3fcdaa3f3fbde33f3f5e
UTF-8 哀넘띿첈麗몃같唯됭쳸鷹귣쳥閱묐ㅉ悠뚪뒽純볦뺑^ 11100101100100111000000011101011100001001001100011101011100111011011111111101100101100101000100011101111101001101000100011101011101010101000001111101010101100001001100111100101100101001010111111101011100100001010110111101100101100111011100011101001101101111011100111101010101101111010001111101100101100111010010111101001100101101011000111101011101011001001000011100011100001011000100111100110100000101010000011101011100110101010101011101011100100101011110111100111101101001001010011101011101100111010011011101011101110101001000101011110 e59380eb8498eb9dbfecb288efa688ebaa83eab099e594afeb90adecb3b8e9b7b9eab7a3ecb3a5e996b1ebac90e38589e682a0eb9aaaeb92bde7b494ebb3a6ebba915e
UHC 哀넘띿첈麗몃같唯됭쳸鷹귣쳥閱묐ㅉ悠뚪뒽純볦뺑^ 111001001110111010110011110100011000110111101100101010101001010111100110101100001011100011101011101100001011000011101010111001101000100111101000101010111001101111101011111011011000001011101011101010111000101011100110111100111001000111101011101001001011100111101010111011011000110011101001100010101011001111100010111011011001001111101100101110111011000101011110 e4eeb3d18decaa95e6b0b8ebb0b0eae689e8ab9bebed82ebab8ae6f391eba4b9eaed8ce98ab3e2ed93ecbbb15e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)