To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????? 001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f
SJIS-WIN ?b????蹂〓? 001111111000001010000010001111110011111100111111001111111110011011111000100000011010110000111111 3f82823f3f3f3fe6f881ac3f
EUC-JP 渶b????蹂〓? 1000111111000111111011011010001111100010001111110011111100111111001111111110110011111010101000101010111000111111 8fc7eda3e23f3f3f3fecfaa2ae3f
UTF-8 渶b벁麟딂삏蹂〓뇲 111001101011100010110110111011111011110110000010111010111011001010000001111011111010011110110011111010111001010010000010111011001000001010001111111010001011100110000010111000111000000010010011111010111000011110110010 e6b8b6efbd82ebb281efa7b3eb9482ec828fe8b982e38093eb87b2
UHC 渶b벁麟딂삏蹂〓뇲 111001111011011110100011111000101001001110100111111011001110100010001010111010001001100010010110111010111011001110100001111010111000011110010110 e7b7a3e293a7ece88ae89896ebb3a1eb8796

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)