To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ??窕?????? 00111111001111111110001001111001001111110011111100111111001111110011111100111111 3f3fe2793f3f3f3f3f3f
EUC-JP ??窕?????? 00111111001111111110001111011010001111110011111100111111001111110011111100111111 3f3fe3da3f3f3f3f3f3f
UTF-8 센섧窕센소센솰센송 111011001000010010111100111011001000010010100111111001111010101010010101111011001000010010111100111011001000011010001100111011001000010010111100111011001000011010110000111011001000010010111100111011001000011010100001 ec84bcec84a7e7aa95ec84bcec868cec84bcec86b0ec84bcec86a1
UHC 센섧窕센소센솰센송 101111001011111010111100101101011111000011010111101111001011111010111100110100101011110010111110101111001110000010111100101111101011110011011011 bcbebcb5f0d7bcbebcd2bcbebce0bcbebcdb

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)