To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 亦??耀??濡??節???〓?穩????? 100101101001001000111111001111111001011101110011001111110011111110010100010001110011111100111111100100001101111100111111001111110011111110000001101011000011111111100010011100100011111100111111001111110011111100111111 96923f3f97733f3f94473f3f90df3f3f3f81ac3fe2723f3f3f3f3f
EUC-JP 亦??耀??濡??節???〓?穩????? 110010111111001000111111001111111100110111010100001111110011111111000111101010000011111100111111110000001110000100111111001111110011111110100010101011100011111111100011110100110011111100111111001111110011111100111111 cbf23f3fcdd43f3fc7a83f3fc0e13f3f3fa2ae3fe3d33f3f3f3f3f
UTF-8 亦낅젔耀붾쑙濡쏅챶節귞븦廬〓젩穩녿젫溜믧쪛 111001001011101010100110111010111000001010000101111011001010000010010100111010001000000010000000111010111011011010111110111011001001000110011001111001101011111110100001111011001000111110000101111011001011000110110110111001111010111110000000111010101011011110011110111010111011100010100110111011111010011010000010111000111000000010010011111011001010000010101001111001111010100110101001111010111000010110111111111011001010000010101011111011111010011110001011111010111010111110100111111011001010101010011011 e4baa6eb8285eca094e88080ebb6beec9199e6bfa1ec8f85ecb1b6e7af80eab79eebb8a6efa682e38093eca0a9e7a9a9eb85bfeca0abefa78bebafa7ecaa9b
UHC 亦낅젔耀붾쑙濡쏅챶節귞븦廬〓젩穩녿젫溜믧쪛 111001101011001010000101111010111010000010010010111010011010010110010100111010111001110010111000111010111010000110011011111010111010101010000011111011111011110110000010111001111001010110001111111001011111111010100001111010111010000010100001111010001011000110000110111010111010000010100011111010101111111010010010111010011010010110010100 e6b285eba092e9a594eb9cb8eba19bebaa83efbd82e7958fe5fea1eba0a1e8b186eba0a3eafe92e9a594

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)