To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????}i?????????}iB 0011111100111111001111110011111100111111001111110011111100111111001111110111110101101001001111110011111100111111001111110011111100111111001111110011111100111111011111010110100101000010 3f3f3f3f3f3f3f3f3f7d693f3f3f3f3f3f3f3f3f7d6942
SJIS-WIN 嚥≪?二??怨??}i嚥≪?二??怨??}iB 10011010100010111000000111100001001111111001001111110001001111110011111110001001100001010011111100111111011111010110100110011010100010111000000111100001001111111001001111110001001111110011111110001001100001010011111100111111011111010110100101000010 9a8b81e13f93f13f3f89853f3f7d699a8b81e13f93f13f3f89853f3f7d6942
EUC-JP 嚥≪?二??怨??}i嚥≪?二??怨??}iB 11010011111010111010001011100011001111111100011011110011001111110011111110110001111001010011111100111111011111010110100111010011111010111010001011100011001111111100011011110011001111110011111110110001111001010011111100111111011111010110100101000010 d3eba2e33fc6f33f3fb1e53f3f7d69d3eba2e33fc6f33f3fb1e53f3f7d6942
UTF-8 嚥≪떜二븝쭓怨뺤죸}i嚥≪떜二븝쭓怨뺤죸}iB 1110010110011010101001011110001010001001101010101110101110010110100111001110010010111010100011001110101110111000100111011110110010101101100100111110011010000000101010001110101110111010101001001110110010100011101110000111110101101001111001011001101010100101111000101000100110101010111010111001011010011100111001001011101010001100111010111011100010011101111011001010110110010011111001101000000010101000111010111011101010100100111011001010001110111000011111010110100101000010 e59aa5e289aaeb969ce4ba8cebb89decad93e680a8ebbaa4eca3b87d69e59aa5e289aaeb969ce4ba8cebb89decad93e680a8ebbaa4eca3b87d6942
UHC 嚥≪떜二븝쭓怨뺤죸}i嚥≪떜二븝쭓怨뺤죸}iB 1110011010111111101000011110110010001011101100101110110010100011101110101110111110100111100010111110101010110011100101011110110010100001100100100111110101101001111001101011111110100001111011001000101110110010111011001010001110111010111011111010011110001011111010101011001110010101111011001010000110010010011111010110100101000010 e6bfa1ec8bb2eca3baefa78beab395eca1927d69e6bfa1ec8bb2eca3baefa78beab395eca1927d6942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)