To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 äޚ밸졅r[äޚ밸졅r[^ 1110010011011110100110101110101110110000101110001110110010100001100001010111001001011011111001001101111010011010111010111011000010111000111011001010000110000101011100100101101101011110 e4de9aebb0b8eca185725be4de9aebb0b8eca185725b5e
SJIS-WIN ????°????r[????°????r[^ 00111111001111110011111100111111100000011000101100111111001111110011111100111111011100100101101100111111001111110011111100111111100000011000101100111111001111110011111100111111011100100101101101011110 3f3f3f3f818b3f3f3f3f725b3f3f3f3f818b3f3f3f3f725b5e
EUC-JP äÞ?ë°¸ì¡?r[äÞ?ë°¸ì¡?r[^ 10001111101010111010001110001111101010011011000000111111100011111010101110110011101000011110101110001111101000101011000110001111101010111100000010001111101000101100001000111111011100100101101110001111101010111010001110001111101010011011000000111111100011111010101110110011101000011110101110001111101000101011000110001111101010111100000010001111101000101100001000111111011100100101101101011110 8faba38fa9b03f8fabb3a1eb8fa2b18fabc08fa2c23f725b8faba38fa9b03f8fabb3a1eb8fa2b18fabc08fa2c23f725b5e
UTF-8 äޚ밸졅r[äޚ밸졅r[^ 1100001110100100110000111001111011000010100110101100001110101011110000101011000011000010101110001100001110101100110000101010000111000010100001010111001001011011110000111010010011000011100111101100001010011010110000111010101111000010101100001100001010111000110000111010110011000010101000011100001010000101011100100101101101011110 c3a4c39ec29ac3abc2b0c2b8c3acc2a1c285725bc3a4c39ec29ac3abc2b0c2b8c3acc2a1c285725b5e
UHC ?Þ??°¸?¡?r[?Þ??°¸?¡?r[^ 00111111101010001010110100111111001111111010000111000110101000101010110000111111101000101010111000111111011100100101101100111111101010001010110100111111001111111010000111000110101000101010110000111111101000101010111000111111011100100101101101011110 3fa8ad3f3fa1c6a2ac3fa2ae3f725b3fa8ad3f3fa1c6a2ac3fa2ae3f725b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)