To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN ?щ?崖リ?瓮?n}?щ?崖リ?瓮?n{^ 0011111110000100100010110011111110001010010100101000001110001010001111111110000101000100001111110110111001111101001111111000010010001011001111111000101001010010100000111000101000111111111000010100010000111111011011100111101101011110 3f848b3f8a52838a3fe1443f6e7d3f848b3f8a52838a3fe1443f6e7b5e
EUC-JP ?щ?崖リ?瓮?n}?щ?崖リ?瓮?n{^ 0011111110100111111010110011111110110011101100111010010111101010001111111110000110100101001111110110111001111101001111111010011111101011001111111011001110110011101001011110101000111111111000011010010100111111011011100111101101011110 3fa7eb3fb3b3a5ea3fe1a53f6e7d3fa7eb3fb3b3a5ea3fe1a53f6e7b5e
UTF-8 吳щ젿崖リ퀓瓮쮗n}吳щ젿崖リ퀓瓮쮗n{^ 111001011001000010110011110100011000100111101100101000001011111111100101101101001001011011100011100000111010101011101101100000001001001111100111100100111010111011101100101011101001011101101110011111011110010110010000101100111101000110001001111011001010000010111111111001011011010010010110111000111000001110101010111011011000000010010011111001111001001110101110111011001010111010010111011011100111101101011110 e590b3d189eca0bfe5b496e383aaed8093e793aeecae976e7de590b3d189eca0bfe5b496e383aaed8093e793aeecae976e7b5e
UHC 吳щ젿崖リ퀓瓮쮗n}吳щ젿崖リ퀓瓮쮗n{^ 11100111111011111010110011101011101000001011000111100100111100001010101111101010101100111000100011101000101101111010100001101111011011100111110111100111111011111010110011101011101000001011000111100100111100001010101111101010101100111000100011101000101101111010100001101111011011100111101101011110 e7efaceba0b1e4f0abeab388e8b7a86f6e7de7efaceba0b1e4f0abeab388e8b7a86f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)