To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????v?????????vB 001111110011111100111111001111110011111100111111001111110011111100111111011101100011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f7642
SJIS-WIN 億?????矣??v億?????矣??vB 10001001101011010011111100111111001111110011111100111111111000011110000100111111001111110111011010001001101011010011111100111111001111110011111100111111111000011110000100111111001111110111011001000010 89ad3f3f3f3f3fe1e13f3f7689ad3f3f3f3f3fe1e13f3f7642
EUC-JP 億?????矣??v億?????矣??vB 10110010101011110011111100111111001111110011111100111111111000101110001100111111001111110111011010110010101011110011111100111111001111110011111100111111111000101110001100111111001111110111011001000010 b2af3f3f3f3f3fe2e33f3f76b2af3f3f3f3f3fe2e33f3f7642
UTF-8 億륁럥栒끿㎉矣ㅻ탵v億륁럥栒끿㎉矣ㅻ탵vB 111001011000010010000100111010111010010110000001111010111001111110100101111001101010000010010010111010111000000110111111111000111000111010001001111001111001111110100011111000111000010110111011111011011000001110110101011101101110010110000100100001001110101110100101100000011110101110011111101001011110011010100000100100101110101110000001101111111110001110001110100010011110011110011111101000111110001110000101101110111110110110000011101101010111011001000010 e58484eba581eb9fa5e6a092eb81bfe38e89e79fa3e385bbed83b576e58484eba581eb9fa5e6a092eb81bfe38e89e79fa3e385bbed83b57642
UHC 億륁럥栒끿㎉矣ㅻ탵v億륁럥栒끿㎉矣ㅻ탵vB 111001011110001010001111111011001000111010001000111000101110001110000101111001111010011110111011111010111111100010100100111010111011010110010010011101101110010111100010100011111110110010001110100010001110001011100011100001011110011110100111101110111110101111111000101001001110101110110101100100100111011001000010 e5e28fec8e88e2e385e7a7bbebf8a4ebb59276e5e28fec8e88e2e385e7a7bbebf8a4ebb5927642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)