To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN 筌?????臾?? 1110001010100011001111110011111100111111001111110011111111100100011010110011111100111111 e2a33f3f3f3f3fe46b3f3f
EUC-JP 筌?????臾?? 1110010010100101001111110011111100111111001111110011111111100111110011000011111100111111 e4a53f3f3f3f3fe7cc3f3f
UTF-8 筌듦퇊溜섇츐臾먯탩 111001111010110110001100111010111001001110100110111011011000011110001010111011111010011110001011111011001000010010000111111011001011100010010000111010001000011110111110111010111010100010101111111011011000001110101001 e7ad8ceb93a6ed878aefa78bec8487ecb890e887beeba8afed83a9
UHC 筌듦퇊溜섇츐臾먯탩 111011111010011110110101111010101011011110011011111010101111111010011000111001011010111010001011111010111010110010010000111011001011010110001011 efa7b5eab79beafe98e5ae8bebac90ecb58b

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)