To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ??????伊??暎??闇?????吟??B 0011111100111111001111110011111100111111001111111000100011001001001111110011111110011101111100110011111100111111100010001100010100111111001111110011111100111111001111111000101111100001001111110011111101000010 3f3f3f3f3f3f88c93f3f9df33f3f88c53f3f3f3f3f8be13f3f42
EUC-JP ??????伊??暎??闇??彛??吟??B 00111111001111110011111100111111001111110011111110110000110010110011111100111111110110101111010100111111001111111011000011000111001111110011111110001111101111001111101000111111001111111011011011100011001111110011111101000010 3f3f3f3f3f3fb0cb3f3fdaf53f3fb0c73f3f8fbcfa3f3fb6e33f3f42
UTF-8 琉뗥깋轢우춺伊싳깄暎노쨱闇됪셀彛뽰쪡吟끿껙B 11101111101001111000110011101011100101111010010111101010101110011000101111101111101001101000110111101100100110101011000011101100101101101011101011100100101111001000101011101100100010111011001111101010101110011000010011100110100110101000111011101011100001011011100011101100101010001011000111101001100101111000011111101011100100001010101011101100100001011000000011100101101111011001101111101011101111011011000011101100101010101010000111100101100100001001111111101011100000011011111111101010101110111001100101000010 efa78ceb97a5eab98befa68dec9ab0ecb6bae4bc8aec8bb3eab984e69a8eeb85b8eca8b1e99787eb90aaec8580e5bd9bebbdb0ecaaa1e5909feb81bfeabb9942
UHC 琉뗥깋轢우춺伊싳깄暎노쨱闇됪셀彛뽰쪡吟끿껙B 11101011101001001000101111100101100000111000100111100110101111001011111111101100101011011001011011101100101001011001101011101100100000111000010111100111101100101011001111101011101001001000101111100100111000011000100111100110101111001011111111101100101011011001011011101100101001011001101011101011111000011000010111100111101100101011001101000010 eba48be58389e6bcbfecad96eca59aec8385e7b2b3eba48be4e189e6bcbfecad96eca59aebe185e7b2b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)