To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????[?????????[^ 001111110011111100111111001111110011111100111111001111110011111100111111010110110011111100111111001111110011111100111111001111110011111100111111001111110101101101011110 3f3f3f3f3f3f3f3f3f5b3f3f3f3f3f3f3f3f3f5b5e
SJIS-WIN 夜??誼??揄ъ?[夜??誼??揄ъ?[^ 1001011011101001001111110011111110001011011000100011111100111111100111011000100110000100100011000011111101011011100101101110100100111111001111111000101101100010001111110011111110011101100010011000010010001100001111110101101101011110 96e93f3f8b623f3f9d89848c3f5b96e93f3f8b623f3f9d89848c3f5b5e
EUC-JP 夜??誼??揄ъ?[夜??誼??揄ъ?[^ 1100110011101011001111110011111110110101110000110011111100111111110110011110100110100111111011000011111101011011110011001110101100111111001111111011010111000011001111110011111111011001111010011010011111101100001111110101101101011110 cceb3f3fb5c33f3fd9e9a7ec3f5bcceb3f3fb5c33f3fd9e9a7ec3f5b5e
UTF-8 夜껊벉誼쎾쮦揄ъ젢[夜껊벉誼쎾쮦揄ъ젢[^ 11100101101001001001110011101010101110111000101011101011101100101000100111101000101010101011110011101100100011101011111011101100101011101010011011100110100011111000010011010001100010101110110010100000101000100101101111100101101001001001110011101010101110111000101011101011101100101000100111101000101010101011110011101100100011101011111011101100101011101010011011100110100011111000010011010001100010101110110010100000101000100101101101011110 e5a49ceabb8aebb289e8aabcec8ebeecaea6e68f84d18aeca0a25be5a49ceabb8aebb289e8aabcec8ebeecaea6e68f84d18aeca0a25b5e
UHC 夜껊벉誼쎾쮦揄ъ젢[夜껊벉誼쎾쮦揄ъ젢[^ 111001011010100010000011111010111001001110101100111010111111111010011011111001011010100010000011111010101111000110101100111011001010000010011011010110111110010110101000100000111110101110010011101011001110101111111110100110111110010110101000100000111110101011110001101011001110110010100000100110110101101101011110 e5a883eb93acebfe9be5a883eaf1aceca09b5be5a883eb93acebfe9be5a883eaf1aceca09b5b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)