To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????n}????????n{^ 001111110011111100111111001111110011111100111111001111110011111101101110011111010011111100111111001111110011111100111111001111110011111100111111011011100111101101011110 3f3f3f3f3f3f3f3f6e7d3f3f3f3f3f3f3f3f6e7b5e
SJIS-WIN 蜊ウ雜ウ蜊ウ謐頴n}蜊ウ雜ウ蜊ウ謐頴n{^ 11100101100011011011001111101000101101101011001111100101100011011011001111100110100011011000100101101111011011100111110111100101100011011011001111101000101101101011001111100101100011011011001111100110100011011000100101101111011011100111101101011110 e58db3e8b6b3e58db3e68d896f6e7de58db3e8b6b3e58db3e68d896f6e7b5e
EUC-JP 蜊ウ雜ウ蜊ウ謐頴n}蜊ウ雜ウ蜊ウ謐頴n{^ 11101001111011011000111010110011111100001011100010001110101100111110100111101101100011101011001111101011111011011011000111010000011011100111110111101001111011011000111010110011111100001011100010001110101100111110100111101101100011101011001111101011111011011011000111010000011011100111101101011110 e9ed8eb3f0b88eb3e9ed8eb3ebedb1d06e7de9ed8eb3f0b88eb3e9ed8eb3ebedb1d06e7b5e
UTF-8 蜊ウ雜ウ蜊ウ謐頴n}蜊ウ雜ウ蜊ウ謐頴n{^ 1110100010011100100010101110111110111101101100111110100110011011100111001110111110111101101100111110100010011100100010101110111110111101101100111110100010101100100100001110100110100000101101000110111001111101111010001001110010001010111011111011110110110011111010011001101110011100111011111011110110110011111010001001110010001010111011111011110110110011111010001010110010010000111010011010000010110100011011100111101101011110 e89c8aefbdb3e99b9cefbdb3e89c8aefbdb3e8ac90e9a0b46e7de89c8aefbdb3e99b9cefbdb3e89c8aefbdb3e8ac90e9a0b46e7b5e
UHC ??雜???謐?n}??雜???謐?n{^ 00111111001111111110110111011010001111110011111100111111110110101100110100111111011011100111110100111111001111111110110111011010001111110011111100111111110110101100110100111111011011100111101101011110 3f3fedda3f3f3fdacd3f6e7d3f3fedda3f3f3fdacd3f6e7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)