To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ?????????????????????? 00111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111001111110011111100111111 3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f3f
SJIS-WIN 暗?????筍ル?筌l?暗?????筍ル?筌 10001000110000110011111100111111001111110011111100111111111000101010000110000011100010110011111111100010101000111000001010001100001111111000100011000011001111110011111100111111001111110011111111100010101000011000001110001011001111111110001010100011 88c33f3f3f3f3fe2a1838b3fe2a3828c3f88c33f3f3f3f3fe2a1838b3fe2a3
EUC-JP 暗?????筍ル?筌l?暗?????筍ル?筌 10110000110001010011111100111111001111110011111100111111111001001010001110100101111010110011111111100100101001011010001111101100001111111011000011000101001111110011111100111111001111110011111111100100101000111010010111101011001111111110010010100101 b0c53f3f3f3f3fe4a3a5eb3fe4a5a3ec3fb0c53f3f3f3f3fe4a3a5eb3fe4a5
UTF-8 暗산램杻쒙쭕筍ル뙕筌l쬃暗산램杻쒙쭕筍ル뙕筌 111001101001101010010111111011001000001010110000111010111001111010101000111011111010011110001000111011001001001010011001111011001010110110010101111001111010110110001101111000111000001110101011111010111001100110010101111001111010110110001100111011111011110110001100111011001010110010000011111001101001101010010111111011001000001010110000111010111001111010101000111011111010011110001000111011001001001010011001111011001010110110010101111001111010110110001101111000111000001110101011111010111001100110010101111001111010110110001100 e69a97ec82b0eb9ea8efa788ec9299ecad95e7ad8de383abeb9995e7ad8cefbd8cecac83e69a97ec82b0eb9ea8efa788ec9299ecad95e7ad8de383abeb9995e7ad8c
UHC 暗산램杻쒙쭕筍ル뙕筌l쬃暗산램杻쒙쭕筍ル뙕筌 1110010011011110101110111110101010110111101001011110101011110100100111001110111110100111100011011110001011101100101010111110101110001100100110101110111110100111101000111110110010100110100110101110010011011110101110111110101010110111101001011110101011110100100111001110111110100111100011011110001011101100101010111110101110001100100110101110111110100111 e4debbeab7a5eaf49cefa78de2ecabeb8c9aefa7a3eca69ae4debbeab7a5eaf49cefa78de2ecabeb8c9aefa7

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)