To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 梧??節??潁??猥??循??倭??央?┐ 1000110011100110001111110011111110010000110111110011111100111111100111111111000100111111001111111110000011001110001111110011111110001111011110100011111100111111100110000110000000111111001111111000100110011011001111111000010010100010 8ce63f3f90df3f3f9ff13f3fe0ce3f3f8f7a3f3f98603f3f899b3f84a2
EUC-JP 梧??節??潁??猥??循??倭??央?┐ 1011100011101000001111110011111111000000111000010011111100111111110111101111001100111111001111111110000011010000001111110011111110111101110110110011111100111111110011111100000100111111001111111011000111111011001111111010100010100100 b8e83f3fc0e13f3fdef33f3fe0d03f3fbddb3f3fcfc13f3fb1fb3fa8a4
UTF-8 梧잍뇣節삣뼔潁쏂쵟猥롳숱循띷뉜倭뗥걶央뉐┐ 111001101010001010100111111011001001111010001101111010111000011110100011111001111010111110000000111011001000001010100011111010111011110010010100111001101011110110000001111011001000111110000010111011001011010110011111111001111000110010100101111010111010000110110011111011001000100010110001111001011011111010101010111010111001110110110111111010111000100110011100111001011000000010101101111010111001011110100101111010101011000110110110111001011010010010101110111010111000100110010000111000101001010010010000 e6a2a7ec9e8deb87a3e7af80ec82a3ebbc94e6bd81ec8f82ecb59fe78ca5eba1b3ec88b1e5beaaeb9db7eb899ce580adeb97a5eab1b6e5a4aeeb8990e29490
UHC 梧잍뇣節삣뼔潁쏂쵟猥롳숱循띷뉜倭뗥걶央뉐┐ 111001111111110010011111111001101000011110001011111011111011110110111011111001011001011010011100111001111011100010011011111010001010110010100000111010001110010110001110111011111011110110100010111000101110000010001101111001101011010010110110111010001101111010001011111001011000000110011100111001001110011110000111111001011010011010100100 e7fc9fe6878befbdbbe5969ce7b89be8aca0e8e58eefbda2e2e08de6b4b6e8de8be5819ce4e787e5a6a4

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)