To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?割孩???匿ぇ 001111111000101010000100100110110111011100111111001111110011111110010011101111011000001010100101 3f8a849b773f3f3f93bd82a5
EUC-JP ?割孩???匿ぇ 001111111011001111100100110101011101100000111111001111110011111111000110101111111010010010100111 3fb3e4d5d83f3f3fc6bfa4a7
UTF-8 뤋割孩컣폀샘匿ぇ 111010111010010010001011111001011000100110110010111001011010110110101001111011001011101110100011111011011000111110000000111011001000001110011000111001011000110010111111111000111000000110000111 eba48be589b2e5ada9ecbba3ed8f80ec8398e58cbfe38187
UHC 뤋割孩컣폀샘匿ぇ 10001111101110111111100111011100111110101010100110110000100011101011110010001111101110111111100111010010111110111010101010100111 8fbbf9dcfaa9b08ebc8fbbf9d2fbaaa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)