To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?ョ?厄θ?淹? 001111111000001110000111001111111001011011101111100000111100011000111111100111111011100100111111 3f83873f96ef83c63f9fb93f
EUC-JP ?ョ?厄θ?淹? 001111111010010111100111001111111100110011110001101001101100100000111111110111101011101100111111 3fa5e73fccf1a6c83fdebb3f
UTF-8 吳ョ뇱厄θ솦淹툫 1110010110010000101100111110001110000011101001111110101110000111101100011110010110001110100001001100111010111000111011001000011010100110111001101011011110111001111011011000100010101011 e590b3e383a7eb87b1e58e84ceb8ec86a6e6b7b9ed88ab
UHC 吳ョ뇱厄θ솦淹툫 11100111111011111010101111100111100001111001010111100100111110001010010111101000100110011001111111100101111101001011100101000010 e7efabe78795e4f8a5e8999fe5f4b942

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)