To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ????→?邑る? 001111110011111100111111001111111000000110101000001111111001011101010111100000101110100100111111 3f3f3f3f81a83f975782e93f
EUC-JP ???沅→?邑る? 0011111100111111001111111000111111000110111010011010001010101010001111111100110110111000101001001110101100111111 3f3f3f8fc6e9a2aa3fcdb8a4eb3f
UTF-8 蓮잙슣沅→쾮邑る뀆 111011111010011010011001111011001001111010011001111011001000101010100011111001101011001010000101111000101000011010010010111011001011111010101110111010011000001010010001111000111000001010001011111010111000000010000110 efa699ec9e99ec8aa3e6b285e28692ecbeaee98291e3828beb8086
UHC 蓮잙슣沅→쾮邑る뀆 111001101110010110011111111010111001101010101111111010101011011010100001111001101011001010000101111010111110100110101010111010111000010110000010 e6e59feb9aafeab6a1e6b285ebe9aaeb8582

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)