To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 凹???ゆ?雍? 1000100110011010001111110011111100111111100000101110010000111111111010001011010000111111 899a3f3f3f82e43fe8b43f
EUC-JP 凹???ゆ?雍? 1011000111111010001111110011111100111111101001001110011000111111111100001011011000111111 b1fa3f3f3fa4e63ff0b63f
UTF-8 凹좑쪡溜ゆ짂雍쫤 111001011000011110111001111011001010001010010001111011001010101010100001111011111010011110001011111000111000001010000110111011001010011110000010111010011001101110001101111011001010101110100100 e587b9eca291ecaaa1efa78be38286eca782e99b8decaba4
UHC 凹좑쪡溜ゆ짂雍쫤 11101000111010101010000011101111101001011001101011101010111111101010101011100110101000111001001011101000101111001010011001110111 e8eaa0efa59aeafeaae6a392e8bca677

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)