To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????h????????? 00111111001111110011111100111111001111110011111100111111001111110011111101101000001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f683f3f3f3f3f3f3f3f3f
SJIS-WIN 藥?ク泳?ぐ娃??h藥?ク泳?ぐ娃?? 1110010101011010001111111000001101001110100010010110101000111111100000101010111010001000101000010011111100111111011010001110010101011010001111111000001101001110100010010110101000111111100000101010111010001000101000010011111100111111 e55a3f834e896a3f82ae88a13f3f68e55a3f834e896a3f82ae88a13f3f
EUC-JP 藥?ク泳?ぐ娃??h藥?ク泳?ぐ娃?? 1110100110111011001111111010010110101111101100011100101100111111101001001011000010110000101000110011111100111111011010001110100110111011001111111010010110101111101100011100101100111111101001001011000010110000101000110011111100111111 e9bb3fa5afb1cb3fa4b0b0a33f3f68e9bb3fa5afb1cb3fa4b0b0a33f3f
UTF-8 藥썹ク泳싪ぐ娃쒏릍h藥썹ク泳싪ぐ娃쒏릍 11101000100101111010010111101100100011011011100111100011100000101010111111100110101100111011001111101100100010111010101011100011100000011001000011100101101010001000001111101100100100101000111111101011101001101000110101101000111010001001011110100101111011001000110110111001111000111000001010101111111001101011001110110011111011001000101110101010111000111000000110010000111001011010100010000011111011001001001010001111111010111010011010001101 e897a5ec8db9e382afe6b3b3ec8baae38190e5a883ec928feba68d68e897a5ec8db9e382afe6b3b3ec8baae38190e5a883ec928feba68d
UHC 藥썹ク泳싪ぐ娃쒏릍h藥썹ク泳싪ぐ娃쒏릍 11100101101101111011110111100111101010111010111111100111101101101001101011101000101010101011000011101000110111111001110011100110101110001010110001101000111001011011011110111101111001111010101110101111111001111011011010011010111010001010101010110000111010001101111110011100111001101011100010101100 e5b7bde7abafe7b69ae8aab0e8df9ce6b8ac68e5b7bde7abafe7b69ae8aab0e8df9ce6b8ac

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)