To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 Þ×ÞÒWÞ×ÞÒJn}Þ×ÞÒWÞ×ÞÒJn{^ 11011110110101111101111011010010010101111101111011010111110111101101001001001010011011100111110111011110110101111101111011010010010101111101111011010111110111101101001001001010011011100111101101011110 ded7ded257ded7ded24a6e7dded7ded257ded7ded24a6e7b5e
SJIS-WIN ?×??W?×??Jn}?×??W?×??Jn{^ 0011111110000001011111100011111100111111010101110011111110000001011111100011111100111111010010100110111001111101001111111000000101111110001111110011111101010111001111111000000101111110001111110011111101001010011011100111101101011110 3f817e3f3f573f817e3f3f4a6e7d3f817e3f3f573f817e3f3f4a6e7b5e
EUC-JP Þ×ÞÒWÞ×ÞÒJn}Þ×ÞÒWÞ×ÞÒJn{^ 1000111110101001101100001010000111011111100011111010100110110000100011111010101011010010010101111000111110101001101100001010000111011111100011111010100110110000100011111010101011010010010010100110111001111101100011111010100110110000101000011101111110001111101010011011000010001111101010101101001001010111100011111010100110110000101000011101111110001111101010011011000010001111101010101101001001001010011011100111101101011110 8fa9b0a1df8fa9b08faad2578fa9b0a1df8fa9b08faad24a6e7d8fa9b0a1df8fa9b08faad2578fa9b0a1df8fa9b08faad24a6e7b5e
UTF-8 Þ×ÞÒWÞ×ÞÒJn}Þ×ÞÒWÞ×ÞÒJn{^ 1100001110011110110000111001011111000011100111101100001110010010010101111100001110011110110000111001011111000011100111101100001110010010010010100110111001111101110000111001111011000011100101111100001110011110110000111001001001010111110000111001111011000011100101111100001110011110110000111001001001001010011011100111101101011110 c39ec397c39ec39257c39ec397c39ec3924a6e7dc39ec397c39ec39257c39ec397c39ec3924a6e7b5e
UHC Þ×Þ?WÞ×Þ?Jn}Þ×Þ?WÞ×Þ?Jn{^ 10101000101011011010000110111111101010001010110100111111010101111010100010101101101000011011111110101000101011010011111101001010011011100111110110101000101011011010000110111111101010001010110100111111010101111010100010101101101000011011111110101000101011010011111101001010011011100111101101011110 a8ada1bfa8ad3f57a8ada1bfa8ad3f4a6e7da8ada1bfa8ad3f57a8ada1bfa8ad3f4a6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)