To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 莨鷺蕩崧 111010001000111010101000111010011011011110111010111010001001010110101001111001011011010010100111 e88ea8e9b7bae895a9e5b4a7
SJIS-WIN ??¨???????´§ 001111110011111110000001010011100011111100111111001111110011111100111111001111110011111110000001010011001000000110011000 3f3f814e3f3f3f3f3f3f3f814c8198
EUC-JP ���崧 100011111010101110110010001111111010000110101111100011111010101110110001001111111000111110100010111010111000111110101011101100100011111110001111101000101110110110001111101010111010100110100001101011011010000111111000 8fabb23fa1af8fabb13f8fa2eb8fabb23f8fa2ed8faba9a1ada1f8
UTF-8 莨鷺蕩崧 110000111010100011000010100011101100001010101000110000111010100111000010101101111100001010111010110000111010100011000010100101011100001010101001110000111010010111000010101101001100001010100111 c3a8c28ec2a8c3a9c2b7c2bac3a8c295c2a9c3a5c2b4c2a7
UHC ??¨?·º????´§ 0011111100111111101000011010011100111111101000011010010010101000101011000011111100111111001111110011111110100010101001011010000111010111 3f3fa1a73fa1a4a8ac3f3f3f3fa2a5a1d7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)