To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN ?割荊???匿ぇ 001111111000101010000100100011000111010000111111001111110011111110010011101111011000001010100101 3f8a848c743f3f3f93bd82a5
EUC-JP ?割荊???匿ぇ 001111111011001111100100101101111101010100111111001111110011111111000110101111111010010010100111 3fb3e4b7d53f3f3fc6bfa4a7
UTF-8 뤋割荊컣폀샘匿ぇ 111010111010010010001011111001011000100110110010111010001000110110001010111011001011101110100011111011011000111110000000111011001000001110011000111001011000110010111111111000111000000110000111 eba48be589b2e88d8aecbba3ed8f80ec8398e58cbfe38187
UHC 뤋割荊컣폀샘匿ぇ 10001111101110111111100111011100111110111010101010110000100011101011110010001111101110111111100111010010111110111010101010100111 8fbbf9dcfbaab08ebc8fbbf9d2fbaaa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)