To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 烏??逸??音?? 100010010100011100111111001111111000100011101101001111110011111110001001101110010011111100111111 89473f3f88ed3f3f89b93f3f
EUC-JP 烏??逸??音?? 101100011010100000111111001111111011000011101111001111110011111110110010101110110011111100111111 b1a83f3fb0ef3f3fb2bb3f3f
UTF-8 烏띾쪋逸븃쵟音쎌댇 111001111000001110001111111010111001110110111110111011001010101010001011111010011000000010111000111010111011100010000011111011001011010110011111111010011001111110110011111011001000111010001100111010111000110010000111 e7838feb9dbeecaa8be980b8ebb883ecb59fe99fb3ec8e8ceb8c87
UHC 烏띾쪋逸븃쵟音쎌댇 111010001010000110001101111010111010010110000101111011001110111110111010111010001010110010100000111010111110010110111101111011001000100010110001 e8a18deba585ecefbae8aca0ebe5bdec88b1

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)