To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????R?????^[?????R?????^[^ 001111110011111100111111001111110011111101010010001111110011111100111111001111110011111101011110010110110011111100111111001111110011111100111111010100100011111100111111001111110011111100111111010111100101101101011110 3f3f3f3f3f523f3f3f3f3f5e5b3f3f3f3f3f523f3f3f3f3f5e5b5e
SJIS-WIN 午??洵?R午??洵?^[午??洵?R午??洵?^[^ 1000110011011111001111110011111110011111101010110011111101010010100011001101111100111111001111111001111110101011001111110101111001011011100011001101111100111111001111111001111110101011001111110101001010001100110111110011111100111111100111111010101100111111010111100101101101011110 8cdf3f3f9fab3f528cdf3f3f9fab3f5e5b8cdf3f3f9fab3f528cdf3f3f9fab3f5e5b5e
EUC-JP 午??洵?R午??洵?^[午??洵?R午??洵?^[^ 1011100011100001001111110011111111011110101011010011111101010010101110001110000100111111001111111101111010101101001111110101111001011011101110001110000100111111001111111101111010101101001111110101001010111000111000010011111100111111110111101010110100111111010111100101101101011110 b8e13f3fdead3f52b8e13f3fdead3f5e5bb8e13f3fdead3f52b8e13f3fdead3f5e5b5e
UTF-8 午댁떓洵쫚R午댁떓洵쫚^[午댁떓洵쫚R午댁떓洵쫚^[^ 11100101100011011000100011101011100011001000000111101011100101101001001111100110101101001011010111101100101010111001101001010010111001011000110110001000111010111000110010000001111010111001011010010011111001101011010010110101111011001010101110011010010111100101101111100101100011011000100011101011100011001000000111101011100101101001001111100110101101001011010111101100101010111001101001010010111001011000110110001000111010111000110010000001111010111001011010010011111001101011010010110101111011001010101110011010010111100101101101011110 e58d88eb8c81eb9693e6b4b5ecab9a52e58d88eb8c81eb9693e6b4b5ecab9a5e5be58d88eb8c81eb9693e6b4b5ecab9a52e58d88eb8c81eb9693e6b4b5ecab9a5e5b5e
UHC 午댁떓洵쫚R午댁떓洵쫚^[午댁떓洵쫚R午댁떓洵쫚^[^ 1110011111101101101101001110110010001011101010011110001011100111101001100110111001010010111001111110110110110100111011001000101110101001111000101110011110100110011011100101111001011011111001111110110110110100111011001000101110101001111000101110011110100110011011100101001011100111111011011011010011101100100010111010100111100010111001111010011001101110010111100101101101011110 e7edb4ec8ba9e2e7a66e52e7edb4ec8ba9e2e7a66e5e5be7edb4ec8ba9e2e7a66e52e7edb4ec8ba9e2e7a66e5e5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)