To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????k?????????S 0011111100111111001111110011111100111111001111110011111100111111001111110110101100111111001111110011111100111111001111110011111100111111001111110011111101010011 3f3f3f3f3f3f3f3f3f6b3f3f3f3f3f3f3f3f3f53
SJIS-WIN 塋よ?鸚?ぐ娃??k塋よ?鸚?ぐ娃??S 100110101100100010000010111001100011111111101010010111110011111110000010101011101000100010100001001111110011111101101011100110101100100010000010111001100011111111101010010111110011111110000010101011101000100010100001001111110011111101010011 9ac882e63fea5f3f82ae88a13f3f6b9ac882e63fea5f3f82ae88a13f3f53
EUC-JP 塋よ?鸚?ぐ娃??k塋よ?鸚?ぐ娃??S 110101001100101010100100111010000011111111110011110000000011111110100100101100001011000010100011001111110011111101101011110101001100101010100100111010000011111111110011110000000011111110100100101100001011000010100011001111110011111101010011 d4caa4e83ff3c03fa4b0b0a33f3f6bd4caa4e83ff3c03fa4b0b0a33f3f53
UTF-8 塋よ떨鸚싪ぐ娃쒑큹k塋よ떨鸚싪ぐ娃쒑큹S 1110010110100001100010111110001110000010100010001110101110010110101010001110100110111000100110101110110010001011101010101110001110000001100100001110010110101000100000111110110010010010100100011110110110000001101110010110101111100101101000011000101111100011100000101000100011101011100101101010100011101001101110001001101011101100100010111010101011100011100000011001000011100101101010001000001111101100100100101001000111101101100000011011100101010011 e5a18be38288eb96a8e9b89aec8baae38190e5a883ec9291ed81b96be5a18be38288eb96a8e9b89aec8baae38190e5a883ec9291ed81b953
UHC 塋よ떨鸚싪ぐ娃쒑큹k塋よ떨鸚싪ぐ娃쒑큹S 1110011110101011101010101110100010110110101100111110010110100100100110101110100010101010101100001110100011011111100111001110100010110100100010000110101111100111101010111010101011101000101101101011001111100101101001001001101011101000101010101011000011101000110111111001110011101000101101001000100001010011 e7abaae8b6b3e5a49ae8aab0e8df9ce8b4886be7abaae8b6b3e5a49ae8aab0e8df9ce8b48853

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)