To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 役??鎰?????v役??鎰?????vB 10010110111100000011111100111111111010000100110000111111001111110011111100111111001111110111011010010110111100000011111100111111111010000100110000111111001111110011111100111111001111110111011001000010 96f03f3fe84c3f3f3f3f3f7696f03f3fe84c3f3f3f3f3f7642
EUC-JP 役??鎰?????v役??鎰?????vB 11001100111100100011111100111111111011111010110100111111001111110011111100111111001111110111011011001100111100100011111100111111111011111010110100111111001111110011111100111111001111110111011001000010 ccf23f3fefad3f3f3f3f3f76ccf23f3fefad3f3f3f3f3f7642
UTF-8 役대씛鎰녶♤琉꾩돱v役대씛鎰녶♤琉꾩돱vB 111001011011110110111001111010111000110010000000111011001001010010011011111010011000111010110000111010111000010110110110111000101001100110100100111011111010011110001100111010101011111010101001111010111000111110110001011101101110010110111101101110011110101110001100100000001110110010010100100110111110100110001110101100001110101110000101101101101110001010011001101001001110111110100111100011001110101010111110101010011110101110001111101100010111011001000010 e5bdb9eb8c80ec949be98eb0eb85b6e299a4efa78ceabea9eb8fb176e5bdb9eb8c80ec949be98eb0eb85b6e299a4efa78ceabea9eb8fb17642
UHC 役대씛鎰녶♤琉꾩돱v役대씛鎰녶♤琉꾩돱vB 111001101011010110110100111010111001110110110000111011001111000010000110111001011010001010111011111010111010010010000100111011001000100110110100011101101110011010110101101101001110101110011101101100001110110011110000100001101110010110100010101110111110101110100100100001001110110010001001101101000111011001000010 e6b5b4eb9db0ecf086e5a2bbeba484ec89b476e6b5b4eb9db0ecf086e5a2bbeba484ec89b47642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)