To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????? 0011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f
SJIS-WIN 丹俗族辿他蔵狸単 10010010010011111001000110101101100100011011000010010010010010001001000110111100100100011010000010010010010010111001001001010000 924f91ad91b0924891bc91a0924b9250
EUC-JP 丹俗族辿他蔵狸単 11000011101100001100001010101111110000101011001011000011101010011100001010111110110000101010001011000011101011001100001110110001 c3b0c2afc2b2c3a9c2bec2a2c3acc3b1
UTF-8 丹俗族辿他蔵狸単 111001001011100010111001111001001011111110010111111001101001011110001111111010001011111010111111111001001011101110010110111010001001010010110101111001111000101110111000111001011000110110011000 e4b8b9e4bf97e6978fe8bebfe4bb96e894b5e78bb8e58d98
UHC 丹俗族?他?狸? 11010011101000011110000111010100111100001110100100111111111101101110001000111111110101111110000100111111 d3a1e1d4f0e93ff6e23fd7e13f

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)