To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ???????????v???????????vB 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011000111111001111110011111100111111001111110011111100111111001111110011111100111111001111110111011001000010 3f3f3f3f3f3f3f3f3f3f3f763f3f3f3f3f3f3f3f3f3f3f7642
SJIS-WIN 將?虞?鷹?????魄v將?虞?鷹?????魄vB 100110111001001000111111100010111111000100111111100100011110100100111111001111110011111100111111001111111110100110101110011101101001101110010010001111111000101111110001001111111001000111101001001111110011111100111111001111110011111111101001101011100111011001000010 9b923f8bf13f91e93f3f3f3f3fe9ae769b923f8bf13f91e93f3f3f3f3fe9ae7642
EUC-JP 將?虞?鷹??勖??魄v將?虞?鷹??勖??魄vB 11010101111100100011111110110110111100110011111111000010111010110011111100111111100011111011001111101101001111110011111111110010101100000111011011010101111100100011111110110110111100110011111111000010111010110011111100111111100011111011001111101101001111110011111111110010101100000111011001000010 d5f23fb6f33fc2eb3f3f8fb3ed3f3ff2b076d5f23fb6f33fc2eb3f3f8fb3ed3f3ff2b07642
UTF-8 將렚虞렧鷹꿴ㄿ勖쾅렠魄v將렚虞렧鷹꿴ㄿ勖쾅렠魄vB 111001011011000010000111111010111010000010011010111010001001100110011110111010111010000010100111111010011011011110111001111010101011111110110100111000111000010010111111111001011000101110010110111011001011111010000101111010111010000010100000111010011010110110000100011101101110010110110000100001111110101110100000100110101110100010011001100111101110101110100000101001111110100110110111101110011110101010111111101101001110001110000100101111111110010110001011100101101110110010111110100001011110101110100000101000001110100110101101100001000111011001000010 e5b087eba09ae8999eeba0a7e9b7b9eabfb4e384bfe58b96ecbe85eba0a0e9ad8476e5b087eba09ae8999eeba0a7e9b7b9eabfb4e384bfe58b96ecbe85eba0a0e9ad847642
UHC 將렚虞렧鷹꿴ㄿ勖쾅렠魄v將렚虞렧鷹꿴ㄿ勖쾅렠魄vB 1110110111100010100011101010110111101001111001011000111010110110111010111110110110110010111010011010010010101111111010011110110111000100111001111000111010110001110110111101111001110110111011011110001010001110101011011110100111100101100011101011011011101011111011011011001011101001101001001010111111101001111011011100010011100111100011101011000111011011110111100111011001000010 ede28eade9e58eb6ebedb2e9a4afe9edc4e78eb1dbde76ede28eade9e58eb6ebedb2e9a4afe9edc4e78eb1dbde7642

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)