To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???唯??魏??餘?????唯??魏??餘??B 00111111001111110011111110010111010000100011111100111111111010011011000000111111001111111110100101010000001111110011111100111111001111110011111110010111010000100011111100111111111010011011000000111111001111111110100101010000001111110011111101000010 3f3f3f97423f3fe9b03f3fe9503f3f3f3f3f97423f3fe9b03f3fe9503f3f42
EUC-JP ???唯??魏??餘?????唯??魏??餘??B 00111111001111110011111111001101101000110011111100111111111100101011001000111111001111111111000110110001001111110011111100111111001111110011111111001101101000110011111100111111111100101011001000111111001111111111000110110001001111110011111101000010 3f3f3fcda33f3ff2b23f3ff1b13f3f3f3f3fcda33f3ff2b23f3ff1b13f3f42
UTF-8 列룸쑜唯욇쐯魏녴돞餘됰쬃列룸쑜唯욇쐯魏녴돞餘됰쬃B 11101111101001101001110011101011101000111011100011101100100100011001110011100101100101001010111111101100100110101000011111101100100100001010111111101001101011011000111111101011100001011011010011101011100011111001111011101001101001001001100011101011100100001011000011101100101011001000001111101111101001101001110011101011101000111011100011101100100100011001110011100101100101001010111111101100100110101000011111101100100100001010111111101001101011011000111111101011100001011011010011101011100011111001111011101001101001001001100011101011100100001011000011101100101011001000001101000010 efa69ceba3b8ec919ce594afec9a87ec90afe9ad8feb85b4eb8f9ee9a498eb90b0ecac83efa69ceba3b8ec919ce594afec9a87ec90afe9ad8feb85b4eb8f9ee9a498eb90b0ecac8342
UHC 列룸쑜唯욇쐯魏녴돞餘됰쬃列룸쑜唯욇쐯魏녴돞餘됰쬃B 11100110111010101011011111101011100111001011101111101010111001101001111011101001100111001001001111101010111000001000011011100011100010011010010011100110101011101000100111101011101001101001101011100110111010101011011111101011100111001011101111101010111001101001111011101001100111001001001111101010111000001000011011100011100010011010010011100110101011101000100111101011101001101001101001000010 e6eab7eb9cbbeae69ee99c93eae086e389a4e6ae89eba69ae6eab7eb9cbbeae69ee99c93eae086e389a4e6ae89eba69a42

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)