To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????B 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111101000010 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f42
SJIS-WIN ???俉??伊??暎??闇?????吟??B 001111110011111100111111111110100110000100111111001111111000100011001001001111110011111110011101111100110011111100111111100010001100010100111111001111110011111100111111001111111000101111100001001111110011111101000010 3f3f3ffa613f3f88c93f3f9df33f3f88c53f3f3f3f3f8be13f3f42
EUC-JP ???俉??伊??暎??闇??彛??吟??B 001111110011111100111111100011111011000110111011001111110011111110110000110010110011111100111111110110101111010100111111001111111011000011000111001111110011111110001111101111001111101000111111001111111011011011100011001111110011111101000010 3f3f3f8fb1bb3f3fb0cb3f3fdaf53f3fb0c73f3f8fbcfa3f3fb6e33f3f42
UTF-8 琉뗥깋俉딆춺伊싳깄暎노쨱闇됭뜇彛뽰쪡吟끿껙B 11101111101001111000110011101011100101111010010111101010101110011000101111100100101111111000100111101011100101001000011011101100101101101011101011100100101111001000101011101100100010111011001111101010101110011000010011100110100110101000111011101011100001011011100011101100101010001011000111101001100101111000011111101011100100001010110111101011100111001000011111100101101111011001101111101011101111011011000011101100101010101010000111100101100100001001111111101011100000011011111111101010101110111001100101000010 efa78ceb97a5eab98be4bf89eb9486ecb6bae4bc8aec8bb3eab984e69a8eeb85b8eca8b1e99787eb90adeb9c87e5bd9bebbdb0ecaaa1e5909feb81bfeabb9942
UHC 琉뗥깋俉딆춺伊싳깄暎노쨱闇됭뜇彛뽰쪡吟끿껙B 11101011101001001000101111100101100000111000100111100111111010111000101011101100101011011001011011101100101001011001101011101100100000111000010111100111101100101011001111101011101001001000101111100100111000011000100111101000100011011000101011101100101011011001011011101100101001011001101011101011111000011000010111100111101100101011001101000010 eba48be58389e7eb8aecad96eca59aec8385e7b2b3eba48be4e189e88d8aecad96eca59aebe185e7b2b342

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)