To what bitstring a character(s) is encoded in each character set?

Input one character or short letters and click "Convert."


(UTF-8)
Charset Character Bit string (binary) Bit String (hexadecimal)
ISO-8859-1 ????????k}????????k{^ 001111110011111100111111001111110011111100111111001111110011111101101011011111010011111100111111001111110011111100111111001111110011111100111111011010110111101101011110 3f3f3f3f3f3f3f3f6b7d3f3f3f3f3f3f3f3f6b7b5e
SJIS-WIN 航??肛??埇?k}航??肛??埇?k{^ 100011010111000100111111001111111110001111101000001111110011111111111010100110100011111101101011011111011000110101110001001111110011111111100011111010000011111100111111111110101001101000111111011010110111101101011110 8d713f3fe3e83f3ffa9a3f6b7d8d713f3fe3e83f3ffa9a3f6b7b5e
EUC-JP 航??肛??埇?k}航??肛??埇?k{^ 1011100111010010001111110011111111100110111010100011111100111111100011111011011111100111001111110110101101111101101110011101001000111111001111111110011011101010001111110011111110001111101101111110011100111111011010110111101101011110 b9d23f3fe6ea3f3f8fb7e73f6b7db9d23f3fe6ea3f3f8fb7e73f6b7b5e
UTF-8 航쒙슭肛됵슬埇풞k}航쒙슭肛됵슬埇풞k{^ 1110100010001000101010101110110010010010100110011110110010001010101011011110100010000010100110111110101110010000101101011110110010001010101011001110010110011111100001111110110110010010100111100110101101111101111010001000100010101010111011001001001010011001111011001000101010101101111010001000001010011011111010111001000010110101111011001000101010101100111001011001111110000111111011011001001010011110011010110111101101011110 e888aaec9299ec8aade8829beb90b5ec8aace59f87ed929e6b7de888aaec9299ec8aade8829beb90b5ec8aace59f87ed929e6b7b5e
UHC 航쒙슭肛됵슬埇풞k}航쒙슭肛됵슬埇풞k{^ 11111001111111101001110011101111101111011011111011111001111111011000100111101111101111011011110111101001101110011011111101000001011010110111110111111001111111101001110011101111101111011011111011111001111111011000100111101111101111011011110111101001101110011011111101000001011010110111101101011110 f9fe9cefbdbef9fd89efbdbde9b9bf416b7df9fe9cefbdbef9fd89efbdbde9b9bf416b7b5e

SJIS-Win,EUC-JP: Classic charsets mainly used as Japanese encoding set on Windows(SJIS-Win=CP932) and UNIX(EUC-JP)