To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????[????????[^ 00111111001111110011111100111111001111110011111100111111001111110101101100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 碩贍碩泄碩贍碩泄[碩贍碩泄碩贍碩泄[^ 1001000011010111111001101101011010010000110101111001111110010101100100001101011111100110110101101001000011010111100111111001010101011011100100001101011111100110110101101001000011010111100111111001010110010000110101111110011011010110100100001101011110011111100101010101101101011110 90d7e6d690d79f9590d7e6d690d79f955b90d7e6d690d79f9590d7e6d690d79f955b5e
EUC-JP 碩贍碩泄碩贍碩泄[碩贍碩泄碩贍碩泄[^ 1100000011011001111011001101100011000000110110011101110111110101110000001101100111101100110110001100000011011001110111011111010101011011110000001101100111101100110110001100000011011001110111011111010111000000110110011110110011011000110000001101100111011101111101010101101101011110 c0d9ecd8c0d9ddf5c0d9ecd8c0d9ddf55bc0d9ecd8c0d9ddf5c0d9ecd8c0d9ddf55b5e
UTF-8 碩贍碩泄碩贍碩泄[碩贍碩泄碩贍碩泄[^ 111001111010001010101001111010001011010010001101111001111010001010101001111001101011001110000100111001111010001010101001111010001011010010001101111001111010001010101001111001101011001110000100010110111110011110100010101010011110100010110100100011011110011110100010101010011110011010110011100001001110011110100010101010011110100010110100100011011110011110100010101010011110011010110011100001000101101101011110 e7a2a9e8b48de7a2a9e6b384e7a2a9e8b48de7a2a9e6b3845be7a2a9e8b48de7a2a9e6b384e7a2a9e8b48de7a2a9e6b3845b5e
UHC 碩贍碩泄碩贍碩泄[碩贍碩泄碩贍碩泄[^ 1110000010110101111000001110101111100000101101011110000011011100111000001011010111100000111010111110000010110101111000001101110001011011111000001011010111100000111010111110000010110101111000001101110011100000101101011110000011101011111000001011010111100000110111000101101101011110 e0b5e0ebe0b5e0dce0b5e0ebe0b5e0dc5be0b5e0ebe0b5e0dce0b5e0ebe0b5e0dc5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)