To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???狡???朧???狡???肌 0011111100111111001111111110000011000010001111110011111100111111100111100100111100111111001111110011111111100000110000100011111100111111001111111001010010100111 3f3f3fe0c23f3f3f9e4f3f3f3fe0c23f3f3f94a7
EUC-JP ???狡???朧???狡???肌 0011111100111111001111111110000011000100001111110011111100111111110110111011000000111111001111110011111111100000110001000011111100111111001111111100100010101001 3f3f3fe0c43f3f3fdbb03f3f3fe0c43f3f3fc8a9
UTF-8 罹뀜렱狡렫ㆁ렱朧렪뀜렱狡렫ㆁ렱肌 111011111010011110100110111010111000000010011100111010111010000010110001111001111000101110100001111010111010000010101011111000111000011010000001111010111010000010110001111001101001110010100111111010111010000010101010111010111000000010011100111010111010000010110001111001111000101110100001111010111010000010101011111000111000011010000001111010111010000010110001111010001000001010001100 efa7a6eb809ceba0b1e78ba1eba0abe38681eba0b1e69ca7eba0aaeb809ceba0b1e78ba1eba0abe38681eba0b1e8828c
UHC 罹뀜렱狡렫ㆁ렱朧렪뀜렱狡렫ㆁ렱肌 1110110010111010101100101111000110001110101111101100111011101010100011101011100110100100111100011000111010111110110101101110100010001110101110001011001011110001100011101011111011001110111010101000111010111001101001001111000110001110101111101101000110111111 ecbab2f18ebeceea8eb9a4f18ebed6e88eb8b2f18ebeceea8eb9a4f18ebed1bf

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)