To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 鿯ê×úëæËÞv鿯ê×úëæËÞvB 1110100111100110110001101110101011010111111110101110101111100110110010111101111001110110111010011110011011000110111010101101011111111010111010111110011011001011110111100111011001000010 e9e6c6ead7faebe6cbde76e9e6c6ead7faebe6cbde7642
SJIS-WIN ????×?????v????×?????vB 00111111001111110011111100111111100000010111111000111111001111110011111100111111001111110111011000111111001111110011111100111111100000010111111000111111001111110011111100111111001111110111011001000010 3f3f3f3f817e3f3f3f3f3f763f3f3f3f817e3f3f3f3f3f7642
EUC-JP 鿯ê×úëæËÞv鿯ê×úëæËÞvB 10001111101010111011000110001111101010011100000110001111101010011010000110001111101010111011010010100001110111111000111110101011111000101000111110101011101100111000111110101001110000011000111110101010101100111000111110101001101100000111011010001111101010111011000110001111101010011100000110001111101010011010000110001111101010111011010010100001110111111000111110101011111000101000111110101011101100111000111110101001110000011000111110101010101100111000111110101001101100000111011001000010 8fabb18fa9c18fa9a18fabb4a1df8fabe28fabb38fa9c18faab38fa9b0768fabb18fa9c18fa9a18fabb4a1df8fabe28fabb38fa9c18faab38fa9b07642
UTF-8 鿯ê×úëæËÞv鿯ê×úëæËÞvB 11000011101010011100001110100110110000111000011011000011101010101100001110010111110000111011101011000011101010111100001110100110110000111000101111000011100111100111011011000011101010011100001110100110110000111000011011000011101010101100001110010111110000111011101011000011101010111100001110100110110000111000101111000011100111100111011001000010 c3a9c3a6c386c3aac397c3bac3abc3a6c38bc39e76c3a9c3a6c386c3aac397c3bac3abc3a6c38bc39e7642
UHC ?æÆ?×??æ?Þv?æÆ?×??æ?ÞvB 001111111010100110100001101010001010000100111111101000011011111100111111001111111010100110100001001111111010100010101101011101100011111110101001101000011010100010100001001111111010000110111111001111110011111110101001101000010011111110101000101011010111011001000010 3fa9a1a8a13fa1bf3f3fa9a13fa8ad763fa9a1a8a13fa1bf3f3fa9a13fa8ad7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)