To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????? 0011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ???伊??蹂??巍ル‘而? 0011111100111111001111111000100011001001001111110011111111100110111110000011111100111111100110111101100110000011100010111000000101100101100011101010011100111111 3f3f3f88c93f3fe6f83f3f9bd9838b81658ea73f
EUC-JP ???伊??蹂??巍ル‘而? 0011111100111111001111111011000011001011001111110011111111101100111110100011111100111111110101101101101110100101111010111010000111000110101111001010100100111111 3f3f3fb0cb3f3fecfa3f3fd6dba5eba1c6bca93f
UTF-8 嶺뚳퐣伊숋㎕蹂⒲럸巍ル‘而쵢 111011111010011010101011111010111001101010110011111011011001000010100011111001001011110010001010111011001000100010001011111000111000111010010101111010001011100110000010111000101001001010110010111010111001111110111000111001011011011110001101111000111000001110101011111000101000000010011000111010001000000010001100111011001011010110100010 efa6abeb9ab3ed90a3e4bc8aec888be38e95e8b982e292b2eb9fb8e5b78de383abe28098e8808cecb5a2
UHC 嶺뚳퐣伊숋㎕蹂⒲럸巍ル‘而쵢 11100111101011011000110011101111101111011000110011101100101001011001100111101111101001111010000111101011101100111010100111100011100011101001011111101000111001001010101111101011101000011010111011101100101110111010110101000010 e7ad8cefbd8ceca599efa7a1ebb3a9e38e97e8e4abeba1aeecbbad42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)