To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN 遙??搖??箋??邀??遙??搖??箋??邀??B 111010101010000100111111001111111001110110001010001111110011111111100010101100110011111100111111111001111011000100111111001111111110101010100001001111110011111110011101100010100011111100111111111000101011001100111111001111111110011110110001001111110011111101000010 eaa13f3f9d8a3f3fe2b33f3fe7b13f3feaa13f3f9d8a3f3fe2b33f3fe7b13f3f42
EUC-JP 遙??搖??箋??邀??遙??搖??箋??邀??B 111101001010001100111111001111111101100111101010001111110011111111100100101101010011111100111111111011101011001100111111001111111111010010100011001111110011111111011001111010100011111100111111111001001011010100111111001111111110111010110011001111110011111101000010 f4a33f3fd9ea3f3fe4b53f3feeb33f3ff4a33f3fd9ea3f3fe4b53f3feeb33f3f42
UTF-8 遙삼슉搖억쉠箋잞슴邀섓쉥遙삼슉搖억쉠箋잞슴邀섓쉥B 11101001100000011001100111101100100000101011110011101100100010101000100111100110100100001001011011101100100101101011010111101100100010011010000011100111101011101000101111101100100111101001111011101100100010101011010011101001100000101000000011101100100001001001001111101100100010011010010111101001100000011001100111101100100000101011110011101100100010101000100111100110100100001001011011101100100101101011010111101100100010011010000011100111101011101000101111101100100111101001111011101100100010101011010011101001100000101000000011101100100001001001001111101100100010011010010101000010 e98199ec82bcec8a89e69096ec96b5ec89a0e7ae8bec9e9eec8ab4e98280ec8493ec89a5e98199ec82bcec8a89e69096ec96b5ec89a0e7ae8bec9e9eec8ab4e98280ec8493ec89a542
UHC 遙삼슉搖억쉠箋잞슴邀섓쉥遙삼슉搖억쉠箋잞슴邀섓쉥B 11101001101010111011101111101111101111011011010111101000111101001011111011101111101111011010101011101111101010001001111111101111101111011011111111101001101011011001100011101111101111011010101111101001101010111011101111101111101111011011010111101000111101001011111011101111101111011010101011101111101010001001111111101111101111011011111111101001101011011001100011101111101111011010101101000010 e9abbbefbdb5e8f4beefbdaaefa89fefbdbfe9ad98efbdabe9abbbefbdb5e8f4beefbdaaefa89fefbdbfe9ad98efbdab42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)