To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ØÉßÈä¨kØÉßÈä¨YØÉßÈä¨k 110110001100100111011111110010001110010010101000011010111101100011001001110111111100100011100100101010000101100111011000110010011101111111001000111001001010100001101011 d8c9dfc8e4a86bd8c9dfc8e4a859d8c9dfc8e4a86b
SJIS-WIN ?????¨k?????¨Y?????¨k 001111110011111100111111001111110011111110000001010011100110101100111111001111110011111100111111001111111000000101001110010110010011111100111111001111110011111100111111100000010100111001101011 3f3f3f3f3f814e6b3f3f3f3f3f814e593f3f3f3f3f814e6b
EUC-JP ØÉßÈä¨kØÉßÈä¨YØÉßÈä¨k 100011111010100110101100100011111010101010110001100011111010100111001110100011111010101010110010100011111010101110100011101000011010111101101011100011111010100110101100100011111010101010110001100011111010100111001110100011111010101010110010100011111010101110100011101000011010111101011001100011111010100110101100100011111010101010110001100011111010100111001110100011111010101010110010100011111010101110100011101000011010111101101011 8fa9ac8faab18fa9ce8faab28faba3a1af6b8fa9ac8faab18fa9ce8faab28faba3a1af598fa9ac8faab18fa9ce8faab28faba3a1af6b
UTF-8 ØÉßÈä¨kØÉßÈä¨YØÉßÈä¨k 110000111001100011000011100010011100001110011111110000111000100011000011101001001100001010101000011010111100001110011000110000111000100111000011100111111100001110001000110000111010010011000010101010000101100111000011100110001100001110001001110000111001111111000011100010001100001110100100110000101010100001101011 c398c389c39fc388c3a4c2a86bc398c389c39fc388c3a4c2a859c398c389c39fc388c3a4c2a86b
UHC Ø?ß??¨kØ?ß??¨YØ?ß??¨k 101010001010101000111111101010011010110000111111001111111010000110100111011010111010100010101010001111111010100110101100001111110011111110100001101001110101100110101000101010100011111110101001101011000011111100111111101000011010011101101011 a8aa3fa9ac3f3fa1a76ba8aa3fa9ac3f3fa1a759a8aa3fa9ac3f3fa1a76b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)