To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????????????????? 001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN ????ゅ?筍る?繹??愿??亘???矜筍 0011111100111111001111110011111110000010111000110011111111100010101000011000001011101001001111111110001110001000001111110011111110011100110000110011111100111111100110000110101000111111001111110011111111100001111000001110001010100001 3f3f3f3f82e33fe2a182e93fe3883f3f9cc33f3f986a3f3f3fe1e0e2a1
EUC-JP ????ゅ?筍る?繹??愿??亘???矜筍 0011111100111111001111110011111110100100111001010011111111100100101000111010010011101011001111111110010111101000001111110011111111011000110001010011111100111111110011111100101100111111001111110011111111100010111000101110010010100011 3f3f3f3fa4e53fe4a3a4eb3fe5e83f3fd8c53f3fcfcb3f3f3fe2e2e4a3
UTF-8 麗멥굥留ゅ쳞筍る연繹먮봽愿졾땔亘留뚩굢矜筍 111011111010011010001000111010111010100110100101111010101011010110100101111011111010011110001101111000111000001010000101111011001011001110011110111001111010110110001101111000111000001010001011111011001001011110110000111001111011100110111001111010111010100010101110111010111011010010111101111001101000010010111111111011001010000110111110111010111001010110010100111001001011101010011000111011111010011110001101111010111001101010101001111010101011010110100010111001111001111110011100111001111010110110001101 efa688eba9a5eab5a5efa78de38285ecb39ee7ad8de3828bec97b0e7b9b9eba8aeebb4bde684bfeca1beeb9594e4ba98efa78deb9aa9eab5a2e79f9ce7ad8d
UHC 麗멥굥留ゅ쳞筍る연繹먮봽愿졾땔亘留뚩굢矜筍 111001101011000010111000111000111000001010001011111010111010011110101010111001011010101110000100111000101110110010101010111010111011111110101100111001101011101010010000111010111001010010000100111010101011010010100000111001011011011010101010110100001110011011101011101001111000110011101000100000101000100111010000111010001110001011101100 e6b0b8e3828beba7aae5ab84e2ecaaebbface6ba90eb9484eab4a0e5b6aad0e6eba78ce88289d0e8e2ec

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)